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(57) Abstract 

A recombinant DNA molecule adapted 
for transfection of a host cell comprising a nu- 
cleic acid molecule encoding mammalian ery- 
thropoietin, an expression control sequence op- 
eratively linked thereto and at least one SAR 
element. The invention also relates to expres- 
sion vectors having the recombinant DNA mol- 
ecule and to mammalian cells transformed with 
the expression vector. The mammalian cells 
lack multiple copies of an amplified amplifica- 
tion gene and are capable of expressing recom- 
binant EPO in vitro at levels of at least 1.500 
u/10 6 cells in 24 hours. The invention fur- 
ther relates to a method of expressing recom- 
binant mammalian erythropoietin using the ex- 
pression vectors and to a transgenic non-human 
animal or embryo whose germ cells and so- 
matic cells contain a DNA construct having 
the recombinant DNA molecule of the inven- 
tion. 
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Title: Recombinant DNA Molecules and 

Expression Vectors for Erythropoietin 

FIELD OF THE INVENTION 

The invention relates to recombinant DNA molecules 
5 adapted for transfection of a host cell, and having a 
nucleic acid molecule encoding mammalian erythropoietin, 
operatively linked to an expression control sequence and 
having at least one SAR element. The invention also 
relates to expression vectors for transfection of a host 
10 cell and to host cells for expressing erythropoietin. 
The invention further relates to methods of preparing 
recombinant erythropoietin using the host cells 
transfected with the expression vectors. 

BACKGROUND OF THE INVENTION 

15 Erythropoietin (EPO) is a heavily glycosylated 

acidic glycoprotein with a molecular weight of 
approximately 35,000. The protein consists of 166 amino 
acids and has a leader signal sequence of 27 amino acids 
which is removed in vivo during secretion from the host 

20 cell. The sequence encoding the unprocessed EPO is 579 
nucleotides in length (Jacobs et al, 1985, Lin et al f 
1985, U.S. Patent 4,703,008 and WO 86/03520). 

Erythropoietin is the principal hormone involved in 
the regulation and maintenance of physiological levels of 

2 5 erythrocytes in mammalian circulation and functions to 

promote erythroid development, to initiate hemoglobin 
synthesis and to stimulate proliferation of immature 
erythroid precursors. The hormone is produced primarily 
by the adult kidney and foetal liver and is maintained in 
30 the circulation at concentrations of about 10-20 
milliunits/ml of serum under normal physiological 
conditions. Elevated levels of EPO, induced by tissue 
hypoxia, trigger proliferation and differentiation of a 
population of receptive progenitor stem cells in the bone 

3 5 marrow, stimulating hemoglobin synthesis in maturing 

SUBSTITUTE SHEET (RULE 26) 
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erythroid cells and accelerating the release of 
erythrocytes from the marrow into the circulation. 

Recombinant EPO has been used to successfully treat 
patients, including patients having anemia as a result of 
5 chronic renal failure. As EPO is the primary regulator of 
red blood cell formation, it has applications in both the 
diagnosis and treatment of disorders of red blood cell 
production and has potential applications for treating a 
range of conditions. 

10 The urine of severely anaemic patients was, at one 

time, almost the sole source for the commercial isolation 
of EPO. U.S. Patent No. 3,033,753 describes a method for 
obtaining a crude EPO preparation from sheep plasma. The 
preparation of monoclonal antibodies specific for human 

15 EPO provided a means for identifying EPO produced from EPO 
mRNA, for screening libraries and for cloning the EPO 
gene. Human EPO cDNA has been cloned and expressed in E. 
coli (Lee-Huang, 1984, Proc. Natl. Acad. Sci. 81:2708). 
Isolation of the human EPO gene using mixtures of short or 

20 long synthetic nucleotides as probes led to the expression 
of biologically active EPO in mammalian cells (Lin, 1985, 
Proc. Natl. Acad. Sci. 82:7580; Lin, WO 85/02610; Jacobs, 
et al., 1985, Nature (Lond. ) 313:806; Goto et al . , 1988, 
Biotechnology 6:67). Jacobs, et al . , 1985, supra, 

25 described the use of plasmids containing EPO DNA which 
were not integrated into the chromosomes of the COS host 
cells, but replicated autonomously in the cells to many 
thousands of copies, thereby killing the cells. Thus the 
expression of EPO was only a transient phenomenon in these 

30 cells. 

Lin, in U.S. Patent No. 4,703,008, reported 
expression of the human EPO gene in COS-1 and CHO cells. 
However, attempts to use transfected cells as production 
vehicles for EPO have been hampered by the low levels of 
3 5 EPO expressed by transfected cells. Given the important 
applications of recombinant EPO, there is much interest in 
developing more efficient methods for the expression of 



BNSDOCIO: <WO 961 9573A1 _l_> 



WO 96/19573 



PCT/CA95/00696 



EPO. 

Lin in U.S. Patent No. 4,703,008 reported methods to 
increase the low amounts of EPO produced by trans fected 
CHO cells (e.g. 2.99u/ml/3 days) by a process of gene 
5 amplification. Levels of approximately 1500 units EPO/10 6 
cells/48 hours were reported by Lin, following 
amplification. Gene amplification involves culturing cells 
in appropriate media conditions to select cells resistant 
to a selective agent, such as the drug methotrexate. 

10 Selection for cells resistant to methotrexate produces 
cells containing greater numbers of DHFR genes and 
passenger genes, such as the EPO gene carried on the 
expression vector along with the DHFR gene or transfected 
with the DHFR gene. 

15 However, gene amplification is a very time consuming and 
labour intensive process. A major disadvantage of 
amplification is the inherent instability of amplified 
genes (McDonald, 1990, Crit. Rev. Biotech. 10:155). As it 
is usually necessary to maintain the amplified cells in 

2 0 the presence of toxic analogs to maintain high copy 
number, amplification may be inappropriate for large scale 
production due to the costs and toxicity of the selective 
agent. The high copy number of the DHFR-target transgene 
may also sequester transcription factor, leading to a 

25 retardation of cell growth. 

Genomic clones of human EPO have been used in 
attempts to develop stably transfected mammalian cell 
lines that secrete high levels of active erythropoietin 
(Powell, et al., 1986, Proc . Natl. Acad. Sci. 83:6465; 

30 Masatsuga, et al., European Patent Application Publication 
No. 0 236 059) . In PCT Application WO 88/00241 Powell, 
describes the preparation of mammalian cell lines (COS-7 
and BHK) transfected with the Apa I restriction fragment 
of the human EPO gene and selected for high expression by 
35 amplification. 

Human EPO cDNA has also been expressed in mammalian 
cells (Yanagi, et al . , 1989, DNA 8: 419). Berstein, in 
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PCT application WO 86/03520 describes the expression of 
EPO cDNA in various host cells, resulting in the secretion 
of up to 160 ng/ml of EPO into the medium after 
amplif ication. European Patent Application publication 
5 No . 0 267 678 discloses expression of recombinant EPO and 
secretion into the culture medium at levels of 600 
units /ml . 

A few scaffold attachment region (SAR) elements have 
been shown to increase the expression of reporter genes in 

10 transfected cells. SAR elements are thought to be DNA 
sequences which mediate attachment of chromatin loops to 
the nuclear matrix or scaffold. SAR elements are also 
known as MAR (matrix-associated regions) (reviewed by Phi- 
Van and Stratling, Prog. Mol . Subcell. Biol. 11:1-11, 

15 1990) . These elements will hereinafter be referred to as 
"SAR elements". SAR elements are usually 300 or more base 
pairs long, and they require a redundancy of sequence 
information and contain multiple sites of protein-DNA 
interaction. SAR elements are found in non-coding 

20 regions: in flanking regions or introns . 

Stief, et al., (Nature 341:343-345, 1989) stably 
transfected chicken macrophage cells by constructs which 
contained the CAT gene either fused to the lysozyme 
promoter, or to the lysozyme promoter and the lysozyme 
,.25 enhancer. When the transcription units contained in both 
constructs were flanked on both sides by lysozyme 5' SAR 
elements (A elements) , gene expression was increased about 
10 times relative to transf ectants , which contained the 
constructs lacking the SAR elements. 

30 Phi-Van, et al., (Mol. Cell. Biol., 10:2302-23 07, 

1990) determined the influence of the SAR element located 
5' to the chicken lysozyme gene (A element) on the CAT 
gene expression from a heterologous promoter (herpes 
simplex virus thymidine kinase promoter) in stably 

35 transfected heterologous cells (rat fibroblasts) . The 
median CAT activity per copy number in transf ectants was 
10 times higher for the transcriptional unit flanked on 
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both sides by A elements than for the transcriptional unit 
lacking SAR elements . 

Klehr, et al . , (Biochemistry, 30:1264-1270, 1991) 
stably transfected mouse L cells by different constructs 
5 containing the human interferon p gene. When the 
construct was flanked by SAR elements, the gene 
transcription level was enhanced 20-3 0 fold with respect 
to the SAR- free construct, containing only the immediate 
regulatory elements . 

10 However, the above-noted experiments have been 

limited to a very few examples of SAR elements, 
expressing mostly reporter genes, such as chloramphenicol 
acetyl transferase (CAT) or lucif erase. SAR elements have 
not shown consistent results in their effect on the 

15 expression of target genes and some target gene sequences 
have been found to inhibit the effect of SAR elements 
(Klehr, et al . , 1991, Biochemistry 30:1264). 
SUMMARY OF THE INVENTION 

The present inventor has significantly found that 

20 SAR elements may be used to increase the expression of 
recombinant mammalian EPO DNA. The present inventor 
constructed expression vectors carrying EPO genomic or 
cDNA sequences flanked by 3' and 5' human apolipoprotein 
B SAR elements. The expression vectors, when transfected 

25 into host cells resulted in increased expression of EPO 
compared to control host cells transfected with EPO 
expression vectors lacking SAR elements. Host cells 
transfected with expression vectors carrying an EPO cDNA 
sequence flanked by 3' and 5' SAR elements which expressed 

30 high levels of EPO were selected and cloned to obtain 
homogenous stable cell lines over-expressing EPO. Cloning 
produced stable cell lines expressing high levels of EPO, 
without the need for amplification. SAR elements have 
not, to the inventor's knowledge, heretofore been used for 

35 long term expression of a target gene in stable cell 
lines, or for the expression of recombinant EPO. 

The present invention thus provides a recombinant 
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DNA molecule adapted for transfection of a host cell 
comprising a nucleic acid molecule encoding mammalian 
erythropoietin, an expression control sequence operatively 
linked thereto and at least one SAR element. In an 
5 embodiment, the nucleic acid molecule encodes mammalian 
erythropoietin having the amino acid sequence shown in SEQ 
ID NOS 3 3 or 34 and Figure 3B. In another embodiment, the 
nucleic acid molecule has the sequence as shown in SEQ ID 
NO. 3 3 or 35. 

10 The SAR element is preferably a SAR element co- 

mapping with the chromatin domain boundary, such as the 
human apolipoprotein SAR elements, most preferably the 
SAR element comprises the sequence as shown in SEQ ID NO. 
36 or 37 and Figures 5 or 6 . 

15 The present invention also provides an expression 

vector comprising a recombinant DNA molecule adapted for 
transfection of a host cell comprising a nucleic acid 
molecule encoding mammalian erythropoietin, an expression 
control sequence operatively linked thereto and at least 

2 0 one SAR element. 

In an embodiment, the expression vector comprises a 
nucleic acid molecule encoding . erythropoietin and having 
the sequence shown in SEQ ID NO. 3 5 and Figure 4 under the 
control of the human cytomegalovirus IE enhancer and 

25 promoter and the beta-globin intron and, flanked by 5' and 
3' apolipoprotein SAR elements. 

In a further embodiment, the expression vector 
comprises a nucleic acid molecule encoding erythropoietin 
and having the sequence shown in SEQ ID NO. 3 5 and Figure 

30 4, flanked by 5' and 3' apolipoprotein SAR elements under 
the control of elongation factor - 1 alpha promoter and 
intron. 

The present invention still further provides a 
mammalian cell stably transfected with the expression 

3 5 vector of the invention. The mammalian cell may be any 

mammalian cell, for example CHO-Kl, BHK, Namalwa. An 
aspect of the invention provides a mammalian cell, lacking 
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multiple copies of an amplified selectable marker gene and 
capable of expressing recombinant EPO in vitro at levels 
of at least 1,500, preferably over 2,000, most preferably 
from 2,000 to 10,000 u/10 6 cells in 24 hours. 
5 In yet a further aspect, the invention provides a 

method of expressing recombinant mammalian erythropoietin 
comprising the steps of culturing a transfected mammalian 
cell of the invention in a suitable medium until 
sufficient amounts of erythropoietin are produced by the 
10 cell and separating the erythropoietin produced. 

The present invention also relates to a method of 
preparing recombinant erythropoietin comprising 
transfecting a mammalian cell with an expression vector 
comprising a nucleic acid molecule encoding mammalian 
15 erythropoietin, an expression control sequence operatively 
linked thereto and at least one SAR element; culturing the 
transfected cell in a suitable medium until sufficient 
amounts of erythropoietin are produced by the cell and 
separating the erythropoietin produced. In a preferred 
20 embodiment, the erythropoietin is produced at levels of at 
least 2,000, most preferably form 2,000 to 10,000 u/10 6 
cells in 24 hours in the absence of gene amplification. 
The invention also relates to erythropoietin produced by 
the method of the invention. 
25 Iri an embodiment of the method, the mammalian cell 

is further transfected with a selectable marker gene and 
transfected cells are selected in conditions where the 
activity of the product encoded by the selectable marker 
gene is necessary for survival of the cells. In a 
3 0 preferred embodiment, the selectable marker gene is 
pSV2neo . 

In a further embodiment, the method comprises the 
additional step of identifying and selecting cells 
producing high levels of erythropoietin; cloning the 
3 5 selected cells; establishing long term cell lines from the 
selected cells and; culturing the selected cells in a 
suitable medium until sufficient amounts of erythropoietin 
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are produced by the cell and separating the erythropoietin 
produced. 

The present invention further provides a transgenic 
non-human animal or embryo whose germ cells and somatic 
5 cells contain a DNA construct comprising the recombinant 
DNA molecule of the invention. 
DESCRIPTION OF THE DRAWINGS 

The invention will be better understood with 
reference to the drawings in which: 
10 Figure 1A is a schematic representation of the 

procedure for synthesizing the erythropoietin gene; 

Figure IB shows the DNA sequences of the 
oligonucleotides synthesized to construct the 
erythropoietin gener- 
is Figure 2A is a schematic representation of the 
procedure for synthesizing the EP0 8hort sequence; 

Figure 2B is a schematic representation of the 
procedure for synthesizing the EP0 long sequencer- 
Figure 3A shows the DNA sequence of the 
20 erythropoietin gene; 

Figure 3B shows the amino acid sequence of the 
mature erythropoietin protein; 

Figure 4 shows the DNA sequence of the EP0 long 
sequence; 

25 Figure 5 shows the DNA sequence of the 3' SAR 

element of human apo lipoprotein B; 

Figure 6 shows the DNA sequence of RhlO; 

Figure 7 shows a restriction map of the vector pLW18 

and; 

3 0 Figure 8 shows a restriction map of the vector 

PLW19 . 

DETAILED DESCRIPTION OF THE INVENTION 

As hereinbefore noted, the present inventor has 
significantly found that recombinant EPO expression is 
3 5 increased in host cells transfected with recombinant DNA 
encoding EPO operatively linked to an expression control 
sequence and SAR elements, compared to host cells 
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transfected with EPO in the absence of SAR elements. 

The present invention thus provides a recombinant 
DNA molecule adapted for transfection of a host cell 
comprising a nucleic acid molecule encoding mammalian 
5 erythropoietin, an expression control sequence operatively 
linked thereto and at least one SAR element . In a 
preferred embodiment the EPO is human EPO and in a 
particularly preferred embodiment the nucleic acid 
molecule has a sequence which encodes erythropoietin 
10 having the amino acid sequence as shown in SEQ ID NO 3 3 
and 34 and Figure 3B. In a particular embodiment, the 
nucleic acid molecule has the sequence as shown in SEQ ID 
NOS. 35 or 33. 

1. Nucleic Acid Molecules Encoding ErYt-h ropoietin 

15 The term "a nucleic acid molecule encoding mammalian 

EPO" as used herein means any nucleic acid molecule which 
encodes biologically active EPO. It will be appreciated 
that, within the context of the present invention, EPO 
may include various structural forms of the primary 

20 protein which retain biological activity. Biologically 
active EPO will include analogues of EPO having altered 
activity, for example having greater biological activity 
than EPO. Biological activity of EPO may be readily 
determined by the methods referred to herein. 

2 5 Nucleic acid molecules encoding EPO include any 

sequence of nucleic acids which encode biologically active 
EPO, preferably, the nucleic acid molecule also encodes 
the leader sequence of the prepeptide to permit secretion 
of EPO from a cell transfected with the recombinant DNA 

3 0 molecule of the invention. The amino acid sequence of the 

leader sequence of the prepeptide is shown in SEQ ID NO. 
33, from amino acid number -27 to -1. in an embodiment, 
the nucleic acid molecule encodes a peptide having the 
amino acid sequence as shown in SEQ ID NO. 33 and 3 4 and 
35 Figure 3B. Nucleic acid molecules encoding EPO include 
the entire EPO gene sequence as shown in SEQ ID NO. 33, 
one or more fragments of this sequence encoding the EPO 
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prepeptide (nucleotides 625-637, 12 01-1346, 1605-1691, 
2303-2482 and 2617-2772 in SEQ ID NO. 33) or the mature 
EPO peptide (nucleotides 1269-1346, 1605-1691, 2303-2482 
and 2617-2769 in SEQ ID NO. 33) or nucleic acid molecules 
5 having substantial homology thereto, or any fragment 
thereof encoding biologically active EPO, such as the 
EP0 long sequence shown in SEQ ID NO. 3 5 and Figure 4. 

It will be appreciated that the invention includes 
nucleotide or amino acid sequences which have substantial 
10 sequence homology with the nucleotide and amino acid 
sequences shown in SEQ ID NOS : 33, 34 and 3 5 and Figures 
3A, 3B and 4. The term "sequences having substantial 
sequence homology" means those nucleotide or amino acid 
sequences which have slight or inconsequential sequence 
15 variations from the sequences disclosed in SEQ ID NOS. 33, 
34 and 35 i.e. the homologous sequences function in 
substantially the same manner to produce substantially the 
same polypeptides as the actual sequences. Due to code 
degeneracy, for example, there may be considerable 
20 variation in nucleotide sequences encoding the same amino 
acid sequence. The variations may be attributable to 
local mutations or structural modifications. 

It will also be appreciated that a double stranded 
nucleotide sequence comprising a DNA segment of the 

2 5 invention or an oligonucleotide fragment thereof, hydrogen 

bonded to a complementary nucleotide base sequence, and an 
RNA made by transcription of this double stranded 
nucleotide sequence, are contemplated within the scope of 
the invention. 

30 A number of unique restriction sequences for 

restriction enzymes are incorporated in the DNA sequence 
identified in SEQ ID NO: 33, and 3 5 and in Figures 3A, 
and 4 respectively, and these provide access to 

nucleotide sequences which code for polypeptides unique to 

3 5 EPO. DNA sequences unique to EPO or isoforms thereof, can 

also be constructed by chemical synthesis and enzymatic 
ligation reactions carried out by procedures known in the 
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art . 

Mutations may be introduced at particular loci for 
instance by synthesizing oligonucleotides containing a 
mutant sequence, flanked by restriction sites enabling 
ligation to fragments of the native sequence. Following 
ligation, the resulting reconstructed sequence encodes a 
derivative having the desired amino acid insertion, 
substitution, or deletion. 

Alternatively, oligonucleotide-directed site- 
specific mutagenesis procedures may be employed to provide 
an altered gene having particular codons altered according 
to the substitution, deletion, or insertion required. 
Deletion or truncation derivatives of EPO may also be 
constructed by utilizing convenient restriction 
15 endonuclease sites adjacent to the desired deletion. 
Subsequent to restriction, overhangs may be filled in or 
removed, and the DNA religated. Exemplary methods of 
making the alterations set forth above are disclosed by 
Sambrook et al . (Molecular cloning A Laboratory Manual, 2d 
20 Ed., Cold Spring Harbor Laboratory Press, 1989). 

The scope of the present invention also includes 
conjugates of EPO along with other molecules such as 
proteins or polypeptides. This may be accomplished, for 
example, by the synthesis of N-terminal or C-terminal 
25 fusion proteins or fragments of proteins to facilitate 
purification or identification of EPO (see U.S. Patent No. 
4,851,341, see also, Hopp et al . , Bio/Technology 6:1204, 
1988.) Thus, fusion proteins may be prepared by fusing 
through recombinant techniques the N-terminal or C- 
3 0 terminal of EPO or other portions thereof, and the 
sequence of a selected protein with a desired biological 
function. The resultant fusion proteins contain EPO or a 
portion thereof fused to the selected protein or portion 
thereof. Examples of proteins which may be selected to 
3 5 prepare fusion proteins include lymphokines such as gamma 
interferon, tumor necrosis factor, IL-1, IL-2.IL-3, IL-4, 
IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, GM-CSF, CSF-1 
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and G-CSF, nerve growth factor, protein A, protein G, GST 
and the Fc portion of immunoglobulin molecules. 

Nucleic acid molecules encoding EPO may be 
chemically synthesized or may be cloned from a genomic or 
5 cDNA mammalian library using oligonucleotide probes 
derived from the known EPO sequences following standard 
procedures. In this manner, nucleic acid molecules 
encoding EPO may be obtained from the cells of a selected 
mammal . 

10 For example, cDNA sequences encoding mammalian EPO 

may be isolated by constructing cDNA libraries derived 
from reverse transcription of mRNA in cells from the 
selected mammal which express EPO, for example adult 
kidney cells or foetal liver cells. Increased levels of 
15 expression in the cell may also be achieved, for example 
by inducing anemia in the mammal. DNA oligonucleotide 
probes may be used to screen the library for positive 
clones. Genomic DNA libraries may also be constructed and 
screened by plaque hybridization using fragments of EPO 
20 cDNA as probes. 

Nucleic acid molecules which encode EPO may also be 
obtained from a variety of sources, including for example, 
depositories which contain plasmids encoding EPO sequences 
including the American Type Culture Collection (ATCC, 
25 Rockville Maryland) , and the British Biotechnology Limited 
(Cowley, Oxford England) . EPO DNA as described in Lin, 
(1985, Proc. Natl. Acad. Sci. U.S.A. 82:7580) is deposited 
as HUMERPA, Accession No M11319. 

Various post translational modifications are 
3 0 contemplated to the EPO encoded by the nucleic acid 
molecule. For example, EPO may be in the form of acidic or 
basic salts, or in neutral form. In addition, individual 
amino acid residues may be modified by, for example, 
oxidation or reduction. Furthermore, various 
3 5 substitutions, deletions, or additions may be made to the 
amino acid or DNA nucleic acid sequences, the net effect 
of which is to retain biological activity of EPO. 
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In a preferred embodiment of the invention, the 
nucleic acid molecule of the invention comprises the EP0 long 
nucleotide molecule encoding EPO, shown in SEQ ID NO. 3 5 
and Figure 4. EP0 lonff may be chemically synthesized by 
5 assembling short nucleotides, for example, as shown in 
Figure IB and SEQ ID NOS . 1 to 32. The procedure for the 
synthesis and assembly of the EPO lGng sequence from the 
short nucleotides is shown schematically in Figure 2B. 
2. SAR Elements 

10 The term "SAR elements" as used herein refers to DNA 

sequences having an affinity or intrinsic binding ability 
for the nuclear scaffold or matrix. SAR elements are 
usually 300 or more base pairs long, require a redundancy 
of sequence information and contain multiple sites of 

15 protein-DNA interaction. Preferably SAR elements are 
found in non coding regions: in flanking regions or 
introns . 

Suitable SAR elements for use in the invention are 
those SAR elements which promote elevated and position- 

20 independent gene activity in stable transf ectants . SAR 
elements may be obtained, for example, from eukaryotes 
including mammals, plants, insects and yeast, preferably 
mammals . SAR elements are preferably selected which co-map 
with the boundaries of a chromatin domain. SAR elements 

25 co-mapping with the chromatin domain boundary are 
preferred for the recombinant DNA molecules of the 
invention to promote the formation of an independent 
domain containing the nucleic acid molecule encoding 
mammalian EPO to be expressed in stable transf ectants . 

3 0 Examples of preferred SAR elements which co-map with 

the chromatin domain boundary include the following: the 
5' human apoB SAR element (RhlO), a Xbal fragment spanning 
nucleotides -5,262 to -2,73 5 of the human apoB gene, as 
shown in SEQ ID 3 7 and Figure 6 and as described in Levy- 

35 Wilson and Fortier, 1989, (J. Biol. Chem. 264:21196). 
This region actually contains two SAR elements, a proximal 
and distal one. It is contemplated that either the 
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proximal and distal SAR elements together or the distal 
element alone may be used. The 3' human apoB SAR element 
extends between nucleotides +43,186 and +43,850. The DNA 
sequence of this 665 bp region is described in Levy-Wilson 
5 and For tier, 1989, supra and shown in SEQ ID NO. 36 and 
Figure 5. A 60-bp deleted 3 'human apoB SAR element 
(Rh32), is also suitable for use in the invention. The 
deletion is shown within brackets in Figure 5. 

Examples of suitable protocols for identifying SAR 
10 elements for use in the present invention are described 
below. The high salt method may be used to identify SAR 
elements by measuring the ability of labelled naked DNA 
fragments to bind nuclear matrices in the presence of 
unlabelled competitor DNA, typically E. coli DNA. DNA 
15 fragments bound to nuclear matrices under these conditions 
are operationally defined as SAR elements. The nuclear 
matrices may be isolated by the 2 M NaCl extraction of 
DNAase I-digested nuclei (Cockerill and Garrard, 1986, 
Cell 44, 273-282) . Chromosomal loop anchorage of the 
20 kappa immunoglobulin gene occurs next to the enhancer in 
a region containing topoisomerase II sites. (Cockerill, 
1990, Nucleic. Acids. Res. IS, 2643-2648). 

In the low salt method for identifying SAR elements 
nuclei are heated to 37°C and then extracted with a buffer 
25 containing 25 mM 3 , 5-diiodosalicylic acid lithium salt 
(LIS) which removes histones. The LIS-treated nuclei are 
extracted several times with a low salt buffer. The 
extracted nuclei are digested with restriction 
endonucleases . The solubilized (non-matrix bound) DNA 
3 0 fragments are removed by centrif ugation . Southern blot 
hybridization with labelled probes identifies the DNA 
fragments bound to the nuclear matrix (Mirkovitch et al . , 
1984, Cell 39, 223-232) . Both the above-noted methods 
yield essentially the same result, that is, a sequence 
35 identified as a SAR element with the high salt method will 
also be identified as a SAR element by the Lis-low salt 
method. 
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The SAR element may be inserted into the recombinant 
molecule of the invention upstream or downstream from the 
nucleic acid molecule encoding EPO and the operatively 
linked expression control sequence. Preferably, the SAR 
5 element is inserted within 0.1 to 100 kb upstream or 
downstream, more preferably from 0.1 to 50 kb, most 
preferably 0.5 to 10. In a preferred embodiment, more than 
one SAR element should be inserted into the recombinant 
molecule of the invention, preferably the SAR elements 

10 should be located in flanking positions both upstream and 
downstream from the nucleic acid molecule encoding EPO 
and the operatively linked expression control sequence. 
The use of flanking SAR elements in the nucleic acid 
molecules may allow the SAR elements to form an 

15 independent loop or chromatin domain, which is insulated 
from the effects of neighbouring chromatin. Accordingly, 
EPO gene expression may be position- independent and the 
level of expression should be directly proportional to the 
number of integrated copies of the recombinant DNA 

20 molecules of the invention. Preferably, the SAR elements 
should be inserted in non-coding regions of the 
recombinant DNA molecule. 

The recombinant DNA molecules of the invention may 
be advantageously used to express elevated levels of 

2 5 mammalian EPO. Routine procedures may be employed to 

confirm that the SAR elements selected by the above-noted 
protocols are useful for expressing elevated levels of 
mammalian EPO. For example, appropriate expression 
vectors comprising a nucleic acid molecule encoding 
30 mammalian erythropoietin and an expression control 
sequence operatively linked thereto may be constructed 
with and without (control vectors) SAR elements. 
Mammalian host cells may be stably transfected with an 
EPO expression vector having a selectable marker gene or 

3 5 may be stably co- transfected with an EPO expression vector 

and a selectable marker gene vector, such as a pSV2-neo 
vector (which carries the gene conferring the resistance 
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to the antibiotic G-418) . The levels of EPO secreted may 
be determined, for example, by RIA and the effect of the 
flanking SAR elements on EPO production may be determined. 
Transfected cell populations producing the highest levels 
5 of EPO as detected by RIA, may be selected and subjected 
to successive rounds of cloning by the dilution method. 
3. Expression Control Sequences 

Suitable expression control sequences may be derived 
from a variety of mammalian sources. Selection of 
10 appropriate regulatory elements is dependent on the host 
cell chosen, and may be readily accomplished by one of 
ordinary skill in the art. Examples of regulatory elements 
include: a transcriptional promoter and enhancer or RNA 
polymerase binding sequence, splice signals, 

15 polyadenylation signals, including a translation 
initiation signal. Additionally, depending on the host 
cell chosen and the vector employed, other genetic 
elements, such as an origin of replication, additional DNA 
restriction sites, enhancers, sequences conferring 
20 inducibility of transcription, and selectable markers, may 
be incorporated into the expression vector. 

Strong promoters (or enhancer /promoters ) are 
preferably selected. A strong promoter is one which will 
direct the transcription of a gene whose product is 
25 abundant in the cell. The relative strength and 
specificities of a promoter /enhancer may be compared in 
comparative transient transfection assays. In an 

embodiment, a promoter may be selected which has little 
cell-type or species preference and which can therefore be 
30 strong when transfected into a variety of cell types. 

In preferred embodiments of the invention, the EF1 
promoter or the human cytomegalovirus (hCMV) IE 
(immediate-early) enhancer and promoter may be used. The 
human EF-la gene promoter is stronger than the adenovirus 
35 major late promoter in a cell free system (Uetsuki et al . , 
1989, J. Biol. Chem. 264:5791). The EF1 promoter is the 
promoter for the human chromosomal gene for the 
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polypeptide chain elongation factor-la: extending from - 
3 03 to -1 for genomic constructs and from -303 to +986 for 
cDNA constructs. For the cDNA constructs, the EF1 DNA 
sequence includes the promoter as well as exon 1 and 
5 intron 1 (Uetsuki et al, 1989, J . Biol. Chem. 264, 5791- 
5798) . The human cytomegalovirus (hCMV) IE (immediate- 
early) enhancer and promoter, extend from -598 to +54 in 
the sequence (Kay and Humphries, 1991, Methods in 
Molecular and Cellular Biology 2, 254-265) . 
10 4, Methods of Expressing ggO 

As hereinbefore noted, the present invention also 
provides expression vectors which include the recombinant 
DNA molecule of the invention and mammalian cells stably 
transfected with the expression vector. 
15 Suitable expression vectors, such as plasmids, 

bacteriophage, retroviruses and cosmids are known in the 
art. Many plasmids suitable for transfecting host cells 
are well known in the art, including among others, pBR3 22 
(see Bolivar et al . , Gene 2:9S, 1977), the pUC plasmids 
20 pUC18, pUC19, pUC118, pUC119 (see Messing, Meth in 
Enzymology 101:2 0-77, 1983 and Vieira and Messing, Gene 
19:259-268, 1982), and pNH8A, pNH16a, pNH18a, and 
Bluescript M13 (Stratagene, La Jolla, Calif.). Retroviral 
vectors are reviewed in Eglitis and Anderson, 1988, 
25 Biotechniques 6:608. Suitable expression vectors are 
those which are stably incorporated into the chromosome of 
the mammalian host cell. 

The recombinant DNA molecule of the invention may be 
expressed by a wide variety of mammalian cells. Methods 
30 for transfecting such cells to express foreign DNA are 
well known in the art (see, e.g., Itakura et al . , U.S. 
Patent No. 4,704,3 62; Hinnen et al . , PNAS USA 75:1929- 
1933, 1978; Murray et al . , U.S. Patent No. 4,801,542; 
Upshall et al . , U.S. Patent No. 4,93 5,3 49; Hagen et al . , 
35 U.S. Patent No. 4,784,950; Axel et al . , U.S. Patent No. 
4,399,216; Goeddel et al . , U.S. Patent No. 4,766,075; and 
Sambrook et al . Molecular Cloning A Laboratory Manual, 2nd 
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edition, Cold Spring Harbor Laboratory Press, 1989, all of 
which are incorporated herein by reference) . Suitable 
expression vectors include vectors having a selectable 
marker gene. 

5 Mammalian cells suitable for carrying out the 

present invention include, among others: COS (e.g., ATCC 
No. CRL 1650 or 1651), BHK (e.g., ATCC No. CRL 6281), CHO- 
Kl (ATCC No. CCL 61), HeLa (e.g., ATCC No. CCL 2), 293 
(ATCC No. 1573) and NS-1 cells. As noted above, suitable 
10 expression vectors for directing expression in mammalian 
cells generally include a promoter, as well as other 
transcriptional and translational control sequences. 
Common promoters include SV40, MMTV, metallothionein-1 , 
adenovirus Ela, CMV, immediate early, immunoglobulin heavy 
15 chain promoter and enhancer, human cytomegalovirus IE 
enhancer and promoter and RSV-LTR . Protocols for the 
transfection of mammalian cells are well known to those of 
ordinary skill in the art. Representative methods include 
calcium phosphate mediated gene transfer, electroporation , 
20 retroviral, and protoplast fusion-mediated transfection 
(see Sambrook et al . , supra). 

Given the teachings provided herein, promoters, 
terminators, and methods for introducing expression 
vectors of an appropriate type into mammalian cells may be 
25 readily accomplished. Accordingly, the invention also 
relates to mammalian cells stably transfected with an 
expression vector of the invention. The term "stably 
transfected" refers to the fact that suitable expression 
vectors are those which stably incorporate the recombinant 
3 0 DNA molecule of the invention into the chromosomes of the 
mammalian cell. 

The invention also relates to a mammalian cell 
lacking a selectable marker gene capable of being 
amplified and, capable of expressing recombinant EPO in 
35 vitro at levels of at least 1,500, preferably at least 
2,000, most preferably 2,000 to 10,000 u/10 6 cells in 24 
hours . 
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In an embodiment, the invention also relates to a 
mammalian cell, having less than 1,000, preferably less 
than 100, most preferably less than 10 copies of a 
selectable marker gene and capable of expressing 
5 recombinant EPO in vitro at levels of at least 1,500, 
preferably at least 2,000, most preferably 2,000 to 10,000 
u/10 6 cells in 24 hours. 

Amplification refers, for example, to the process of 
culturing cells in appropriate medium conditions to 
10 select cells resistant to the drug methotrexate. Such 
cells have been found to be resistant to methotrexate due 
to an amplification of the number of their gene encoding 
dihydrofolate reductase (DHFR) . Selection for cells 
resistant to methotrexate produces cells containing 
15 greater numbers of DHFR genes. Passenger genes, such as 
the EPO gene carried on the expression vector along with 
the DHFR gene or co- trans fee ted with the DHFR gene may 
also be increased in their gene copy number. Cells which 
have been amplified thus have multiple copies of the 
20 selectable marker gene in addition to the passenger gene. 

It is an advantage of the present invention that 
high levels of EPO expression may be achieved without the 
need for amplification. Thus the mammalian cells of the 
invention express high levels of EPO and do not express 
25 high levels of a selectable marker gene. The expression 
of high levels of selectable marker genes in amplified 
cells may interfere with the cell's ability to remain 
stable in long term culture and to express high levels of 
EPO in long term culture. Amplified cells may carry 
30 certain disadvantages such as non-specific toxicity 
associated with exposure of the cells to the inhibitory 
drug or compound. 

Selectable marker genes which may be amplified in a 
mammalian host cell are known in the art and include the 
3 5 genes encoding proteins conferring resistance to 
chloramphenicol (Wood et al . , CSHSQB 51:1027, 1986), 
methotrexate (Miller, MC Biol., 5:431, 1985; Corey et al . , 
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Blood 75:337, 1990; Williams et al . , Proc. Natl. Acad. 
Sci. USA, 83:2566, 1986; Stead et al . , Blood 71:742, 
1988), mycophenolic acid (Stuhlmann et al . , Proc. Natl. 
Acad. Sci. USA 81:7151, 1984), or various chemotherapeutic 
5 agents (Guild et al . , Proc. Natl. Acad. Sci USA 85:1595, 
1988; Kane et al . , Gene 84:439, 1989; Choi et al . , Proc. 
Natl. Acad. Sci. USA; Sorrentino, et al . , Science 257:99, 
1992) . A discussion of selectable marker genes, including 
those capable of being amplified is provided in Kreigler, 
10 1990, "Gene Transfer and Expression, A Laboratory Manual" , 
Chapter 6, Stockton Press, and include the gene encoding 
dihydrof olate reductase. 

Within another aspect, the present invention relates 
to a method of preparing recombinant erythropoietin 
15 comprising transfecting a mammalian cell with an 
expression vector comprising a nucleic acid molecule 
encoding mammalian erythropoietin, an expression control 
sequence operatively linked thereto and at least one SAR 
element; culturing the transfected cell in a suitable 
20 medium until sufficient amounts of erythropoietin are 
produced by the cell and separating the erythropoietin 
produced. In an embodiment, the mammalian cell may be 
further transfected with a selectable marker gene and the 
transfected cells may be selected by means of the 
25 selectable marker gene. Examples of selectable marker 
genes are given above and include neo. 

In an embodiment of the method, cells producing high 
levels of erythropoietin may be identified and selected 
and subcloned to establish long term cell lines from the 
30 selected cells and; the selected cells may be cultured in 
a suitable medium until sufficient amounts of 
erythropoietin are produced by the cell. EPO produced may 
then be separated from the medium. 

It is an advantage of the method of the invention 
3 5 that cell lines may be established which are stable over 
the long term, at least over six months. The long term 
cell lines of the invention express consistently high 
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levels of EPO and may be maintained without the selective 
pressure often required to maintain the high copy number 
of amplified genes in cell lines which have been subjected 
to amplif ication. 
5 EPO may be prepared by culturing the host /vector 

systems described above, in order to express the EPO. 
Recombinantly produced EPO may be further separated and 
further purified as described in more detail below. 

Biologically active EPO expressed may be assayed by 
10 known procedures such as tritiated thymidine uptake by 
mouse spleen cell erythrocyte precursors (Krystal, et al . , 
1986, Blood 67:71); the exhypoxic mouse method using 59 Fe 
incorporation into erythrocyte precursors (Cotes and 
Bangham, 1961, Nature 191:1065); 59 Fe uptake into fetal 
15 mouse liver cells (Dunn et al, 1975, Exp. Hematol . 3:65); 
and the starved rat method (Goldwasser and Gross, 1975, 
Methods Enzymol. 37:109). EPO once expressed may also be 
quantitated, for example by RIA, separated and purified by 
known techniques such as ultrafiltration, flat-bed 
20 electrof ocusing, gel filtration, electrophoresis, 
isotachophoresis and various forms of chromatography, such 
as ion exchange, adsorption chromatography, column 
electrophoresis and various forms of HPLC . Procedures for 
the chromatographic separation of EPO are described, for 
25 example in U.S. Patent No 4,667,016. 

The present invention also relates to transgenic 
non-human mammals or embryos whose germ cells and somatic 
cells contain a DNA construct comprising the recombinant 
DNA molecule of the invention. The recombinant DNA 
3 0 molecule of the invention may be expressed in non-human 
transgenic animals such as mice, rats, rabbits, sheep, 
cows and pigs (see Hammer et al . (Nature 315:680-683, 
1985), Palmiter et al. (Science 222:809-814, 1983), 
Brinster et al . (Proc Natl. Acad. Sci USA 82:44384442, 
35 1985), Palmiter and Brinster (Cell. 41:343-345, 1985) and 
U.S. Patent No. 4,736,866). Briefly, an expression unit, 
including a DNA sequence to be expressed together with 
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appropriately positioned expression control sequences, is 
introduced into pronuclei of fertilized eggs. 
Introduction of DNA is commonly done by microinjection. 
Integration of the injected DNA is detected by blot 
5 analysis of DNA from tissue samples, typically samples of 
tail tissue. It is preferred that the introduced DNA be 
incorporated into the germ line of the animal so that it 
is passed on to the animal's progeny. Tissue-specific 
expression may be achieved through the use of a tissue- 
10 specific promoter, or through the use of an inducible 
promoter, such as the metallothionein gene promoter 
(Palmiter et al . , 1983, ibid), which allows regulated 
expression of the transgene. Alternatively, yeast 
artificial chromosomes (YACs) may be utilized to introduce 
15 DNA into embryo -de rived stem cells by fusion with yeast 
spheroblasts carrying the YAC (see Capecchi, Nature 
362:255-258, 1993; Jakobovits et al . , Nature 362:255-258, 
1993). Utilizing such methods, animals may be developed 
which express EPO in tissues. Tissue specific promoters 
20 may be used to target expression of EPO in cells. Tissue 
specific promoters include the 5' or 3' flanking sequences 
of the beta-globin, elastase, alpha-f etoprotein, alpha-A 
crystalline, an erythroid specific transcriptional element 
and insulin genes (Yee, et al . (1989) P.N.A.S., U.S.A. 
25 86, 5873-5877; Swift, et al . 1984, Cell 38:639; Storb et 
al., Nature (Lond.) 310:238; Grosscheldl et al . , 1985 Cell 
41:885; Shani , 1985 Nature (Lond) 314:238 and Chada et al , 
1985, Nature (Lond)). The use of SAR elements in the 
development of transgenic animals is described for example 
30 in Xu, 1989, J. Biol. Chem. 264:21190; McKnight et al., 
1992, Proc. Natl. Acad. Sci. U.S.A. 89:6943; Brooks, et 
al . , 1994, Mol. Cell. Biol. 14:2243 and; Forrester, et 
al., 1994, Science 265:1221. 

In a preferred embodiment suitable promoters and/or 
3 5 enhancers may be selected from mammary gland specific 
genes which are normally only expressed in milk, for 
example the genes encoding a-casein (Gene Pharming , 
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Leiden, Netherlands), |J-casein (Genzyme Transgenics Corp. 
Framingham, Mass.), ycasein, K-casein a-lactablbumin (3- 
lactalbumin 0-lactogloblin (PPL Therapeutics Ltd, 
Edinburgh, Scotland) and whey acidic protein (Altra Bio 
5 Inc., Arden Hills MN) . Methods for targeting recombinant 
gene expression to the mammary gland of a mammal are 
described, for example, in U.S. Patent No. 5,304,489. 
Briefly, a DNA construct with the recombinant DNA molecule 
of the invention comprising a mammary gland specific 

10 promoter may be microinjected into a newly fertilized egg 
leading to integration of the construct into the genome 
and secretion of the protein into the milk of a mature 
lactating female. EPO expressed in the milk may be 
purified from the milk using standard procedures after 

15 skimming off the milk fat, separating out the caseins and 
precipitating EPO out of the whey fraction, followed by 
standard protein purification procedures for EPO as 
described elsewhere herein. 

A major problem in the generation of transgenic 

20 mammals for the expression of recombinant proteins has 
been the varying levels of expression which result due to 
chromosomal factors in the local environment where the 
construct integrates, for example regulatory elements and 
the state of the chromatin (open or closed) . As a result, 

2 5 generating a transgenic mammal that produces high levels 

of ^ recombinant protein has been achieved only by 
laborious trial and error. 

The present invention provides a transgenic non- 
human animal whose germ cells and somatic cells contain a 

3 0 DNA construct comprising the recombinant DNA molecules of 

the invention, comprising a tissue specific promoter, 
preferably the tissue specific promoter is a promoter 
which specifically expresses EPO in milk. In a 

particularly preferred embodiment, the recombinant 
3 5 molecule contained in the transgenic non-human mammal 
comprises the nucleic acid molecule encoding EPO and the 
expression control sequence operatively linked thereto, 
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flanked by SAR elements. An advantage of DNA constructs 
having flanking SAR elements is that expression of EPO may 
be independent of the site of integration of the construct 
as the construct is insulated from surrounding chromosome 
5 material by the SAR elements, which define an open 
chromatin domain. 
5 . Applications 

It will be apparent that the recombinant DNA 
molecules, expression vectors, transfected host cells, 

10 methods and transgenic animals of the invention will be 
useful for the efficient expression and production of 
recombinant mammalian EPO in vitro and in vivo as 
described above. 

Recombinant EPO may be used to treat animals and 

15 human patients, including patients having anemia as a 
result of chronic renal failure. As EPO is the primary 
regulator of red blood cell formation, it has applications 
in both the diagnosis and treatment of disorders of red 
blood cell production and has potential applications for 

2 0 treating a range of conditions such as anemia, sickle cell 

disease, conditions where red cells are depleted (for 
example in bone marrow transplants) , thalassemia, cystic 
fibrosis, menstrual disorders, acute blood loss and 
conditions involving abnormal erythropoiesis (for example 

25 cancers of the haemopoietic system) , conditions involving 
destruction of red blood cells by over exposure to 
radiation, reduction in oxygen intake at high altitudes, 
complications or disorders secondary to AIDS and prolonged 
unconsciousness . 

30 The recombinant DNA molecules, expression vectors 

and transformed mammalian cells of the invention will also 
have useful applications in gene therapy, whereby a 
functional EPO gene is introduced into a mammal in need 
thereof, for example mammals having anemias. The transfer 

3 5 of the recombinant DNA molecule of the invention into 

mammalian cells may be used, for example in gene therapy 
to correct an inherited or acquired disorder through the 
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synthesis of missing or defective EPO gene products in 
vivo . 

The recombinant DNA molecule of the invention may be 
used in gene therapy as briefly described below. The 
5 recombinant DNA molecule may be introduced into cells of 
a mammal, for example haemopoietic stem cells removed from 
the bone marrow or blood of the mammal. Hemopoietic stem 
cells are particularly suited to somatic gene therapy as 
regenerative bone marrow cells may be readily isolated, 
10 modified by gene transfer and transplanted into an 
immunocompromised host to reconstitute the host's 
hemopoietic system. Suitable hemopoietic stem cells 
include primitive hemopoietic stem cells capable of 
initiating long term culture (Sutherland et al . , Blood, 
15 Vol. 74, p. 1563, 1986 and Udomsakdi et al., Exp. 
Hematol., Vol. 19, p. 338, 1991.) Suitable cells also 
include fibroblasts and hepatocytes . 

The recombinant DNA molecules of the invention may 
be introduced into the cells by known methods, including 
calcium phosphate mediated transfection described herein 
or retroviral mediated uptake. The recombinant DNA 
molecule of the invention may be directly introduced into 
cells or tissues in vivo using delivery vehicles such as 
retroviral vectors, adenoviral vectors and DNA virus 
25 vectors. They may also be introduced into cells in vivo 
using physical techniques such as microinjection and 
electroporation or chemical methods such as 
coprecipitation and incorporation of DNA into liposomes. 
Recombinant molecules may also be delivered in the form of 
3 0 an aerosol or by lavage. The recombinant DNA molecules of 
the invention may also be applied extracellularly such as 
by direct injection into cells. Freed et al . , New Eng. J. 
Med. 327(22) : 1549-1555, 1992, describe a method for 
injecting fetal cells into brains of Parkinson's patients. 
3 5 Gene therapy involving bone marrow transplant with 
recombinant primary hemopoietic stem cells requires 
efficient gene transfer into the stem cells. As a very 
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small number of primary stem cells can reconstitute the 
entire host hemopoietic system it is important that the 
transferred gene be efficiently expressed in the 
recombinant stem cells transferred. Thus it is expected 
5 that the recombinant molecules of the invention will be 
particularly advantageous for use in gene therapy to 
correct defects in the erythropoietin gene. 

As hereinbefore noted, the recombinant DNA molecules 
of the invention, having flanking SAR elements, are 
10 particularly useful for the expression of EPO in 
transgenic mammals and therefore they are particularly 
useful in gene therapy as the SAR elements define an open 
chromatin domain and insulate the construct from the 
surrounding chromosome material, thereby, providing 
15 position-independent expression. 

The following non-limiting examples are illustrative 
of the present invention: 

EXAMPLES 
EXAMPLE 1 

2 0 Synthesis of EPO cDNA Sequences 

EPO Oligonucleotide synthesis 

EPO oligonucleotides were synthesized with the 
Applied Biosystems Inc. 392 DNA/RNA Synthesizer at the 0.2 
|im scale (-500 \ig) . Oligonucleotide 5' ends were 

25 phosphorylated, except for the ones coinciding with the 5' 
ends of each of the four blocks. Oligonucleotides lacking 
5' phosphorylation were purified using the Applied 
Biosystems Oligo Purification Cartridge (OPC) . 
Oligonucleotides with 5' phosphorylation were purified 

30 from acrylamide gels (according to the protocol described 
in Sambrook, Fritsch, and Maniatis, 1989. Molecular 
Cloning: A Laboratory Manual, 2nd Edition. Cold Spring 
Harbor Laboratory Press, pp. 11.23 - 11.30). C 18 Sep-Pak 
cartridges from Millipore were used to remove salts. All 

3 5 oligos were resuspended in double distilled H 2 0. 

Oligonucleotides EPOl (SEQ ID NO. 1), EP02 (SEQ ID 
NO. 2), EP03 { SEQ ID NO . 3), EP04 (SEQ ID NO. 4), EP05 



BNSDOCID: <WO 9619573A1_I_> 



WO 96/19573 



PCT/CA95/00696 



(SEQ ID NO. 5), EP06 (SEQ ID NO. 6), EP07 (SEQ ID NO. 7), 
EP08 (SEQ ID NO. 8), EP09 (SEQ ID NO. 9), EPO10 (SEQ ID 
NO. 10), EPOll (SEQ ID NO. 11), EP012 (SEQ ID NO. 12), 
EP013 (SEQ ID NO. 13), EPO!3b (SEQ ID NO. 14), EP014 (SEQ 
5 ID NO. 15), EP015 (SEQ ID NO. 16), EPOlCt (SEQ ID NO. 17), 
EP02a (SEQ ID NO. 18), EP03a (SEQ ID NO. 19), EPO40C (SEQ 
ID NO. 20), EPOBa (SEQ ID NO. 21), EPOSa (SEQ ID NO. 22), 
EP07a (SEQ ID NO, 23), EP08a (SEQ ID NO. 24), EP09a (SEQ 
ID NO. 25), EPOlOa (SEQ ID NO. 26), EPOlla (SEQ ID NO. 
10 27), EP012a (SEQ ID NO. 28), EP013a (SEQ ID NO. 29), 
EP013BOC (SEQ ID NO. 3 0 EP0140C (SEQ ID NO . 31) and EPO150C 
(SEQ ID NO. 32) as shown in Figure IB were synthesized. 
All oligonucleotides labelled with "a" are for the 
complementary strand (i.e. negative sense of the EPO 
15 gene). Oligonucleotides EPOl (SEQ ID NO. 1), EPOla (SEQ 
ID NO. 17), EP015 (SEQ ID NO. 16) and EP015a (SEQ ID NO. 
32) contain extra bases at their 5 '-ends and 3 ' ends , 
respectively, which facilitate construction of a Hindlll 
recognition site. 
2 0 Assembly of blocks 1, 2, 3 and 4 

Blocks 1, 2, 3, and 4 as shown in Figure 1A were 
synthesized by ligating the above-noted EPO 
oligonucleotides according to the following protocol. 

40 pmoles of each oligonucleotide were mixed in a 
25 microcentrifuge tube in a final volume of SOjLll . The 
oligonucleotides were annealed by heating at 98°C for 5 
minutes in a heat block. The heat block containing the 
tube was removed from the heating unit and allowed to cool 
on the benchtop to 3 0°C (approximately 1.5 hrs) . 6(il of 
30 10X T 4 DNA ligase buffer (0.5M Tris-HCl, pH 7.8, 0 . 1M 
MgCl 2 , 0.1M DTT, lOmM ATP and 250|ig/ml BSA) , was added to 
2JJ.1 (or 2 units) of T 4 DNA ligase, and brought to volume to 
60^x1 with double distilled H 2 0 and incubated overnight at 
14°C. The mixture was heated at 75°C for 10 minutes to 
35 inactive ligase and then cooled on ice to dissociate non 
ligated oligonucleotides and ethanol precipitate. 

Blocks 1, 2, 3 and 4 were purified as follows. 
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Each ligation mix was run on a 3% low melting agarose gel. 
No band of the expected size was visible on the gel. 
However, DNA was extracted from the region of the gel 
where the gene block was expected to be. The rationale 
5 was that there was a small amount of the correct gene 
block present. 

The DNA extracted from the agarose gel was amplified 
by the polymerase chain reaction (with the NE Biolabs Vent 
DNA polymerase, 1 cycle 30 sec at 98°C, 25-30 cycles 30 

10 sec at 98°C, 40 sec at 50~55°C and 2 min at 72°C, 1 cycle 
10 min at 72°C, and cooled at 6°C. The primers were 
complementary to the ends of the gene blocks and contained 
a few extra bases so that the entire recognition sites for 
the specific restriction enzymes flanking each block would 

15 be present in each complete block. Blocks were then 
cloned into the Smal site of pUC18 or pUC19 plasmids . 

Assembled gene blocks were sequenced by the dideoxy- 
terminator method (Sanger, and Coulson, 1975, J. Mol . 
Biol. 94., 441-448). Typically, 5-12 clones had to be 

20 sequenced for each gene block in order to identify one 
with the expected sequence. 

EPO cDNA sequences were assembled as described in 
Example 2 herein. EPO short and EPO iong cDNA were assembled 
into pUC18 as shown in Figures 2A and 2B, to generate 

25 pLW2 0 and pLW21, respectively. 

EXAMPLE 2 
Assembly of Expression Vectors 

Synthesis and assembly of EPO cDNA 

Two EPO cDNA sequences were chemically synthesized: 

3 0 EPO short and EP0 long . The reason for making both a long and 
a short version of the gene was based on previous results 
while expressing another gene in COS cells. It had been 
noted that reducing the length of the 3' non-coding 
sequence coincided with increased expression of the gene. 

3 5 However, this was not observed with EPO. Therefore the 
constructs used for this example contained the EP0 long cDNA, 
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as shown in SEQ ID NO. 3 5 and Figure 4. 

Figure 1A, provides a schematic representation of 
the synthesis and assembly of the EP0 long cDNA sequence (SEQ 
ID NO. 35 and Figure 4) from the oligonucleotides. 
5 Briefly, the coding and complementary strands from 
contiguous oligonucleotides which had an average length of 
60 bases were synthesized. The breakpoints between the 
oligonucleotides were chosen such that when two 
complementary oligonucleotides annealed, cohesive ends 
10 compatible with those of the adjoining oligonucleotide 
pair were created. The EP0 long cDNA (SEQ ID NO. 3 5 and 
Figure 4) was constructed from fifteen oligonucleotide 
pairs, shown in Figure IB and SEQ ID NOS . 1-32) The 
coding strand oligonucleotides were numbered from 1 to 15 
15 except for 13 which was replaced with 13b, and the 
complementary strand oligonucleotides were numbered la to 
15a except for 13a which was replaced with 13ba* 

The fifteen oligonucleotide pairs were assembled in 
four blocks. Each junction between adjacent blocks 
20 constitutes a unique restriction site. Only three silent 
mutations were introduced into the EPO lGng cDNA sequence for 
the creation of unique restriction sites: (1) c at 
position 22 was replaced with A, to add a BstXI site, (2) 
C at position 256 was replaced with T, to remove a PstI 
25 site, and (3) C at position 705 was replaced with T, to 
add a Sspl site. The assembly of the four blocks was done 
in PUC18, leading to the vector pLW21. pLW21 is the pUC18 
vector with the 788 bp EP0 long cDNA sequence, as shown in 
SEQ ID NO. 3 5 and Figure 4, inserted in the Hindlll 
30 restriction site. 

Isolation of genomic EPO DNA 

A human leukocyte genomic library was purchased from 
Clontech and was screened by plaque hybridization with 
fragments of EPO cDNA as probes. A 2 . 4kb (2,3 65bp) EPO 
3 5 genomic clone was isolated. This clone spans the HUMERPA 
sequence (Genbank, accession number M11319) (SEQ ID NO. 33 
and Figure 3A) from nucleotide 499 to nucleotide 2365, 
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that is to say its 5' end maps 126 nucleotides upstream of 
the ATG initiation coding of the EPO pre-protein. The 
rationale for isolating the EPO gene was that the length 
of the DNA between the SAR elements would be longer and 
5 might allow for an increased effect of the SAR elements. 
SAR elements 

Two human apolipoprotein B (apoB) SAR elements were 
used which co-map with the boundaries of the human 
apolipoprotein B gene chromatin domain (Levy-Wilson & 

10 Fortier, 1989, J. Biol. Chem. 264:21196). The following 
clones were used: RhlO carrying the distal 1212 bp-long 
5 'human apoB SAR element and 1317 bp of proximal sequence 
(SEQ ID NO. 3 7 and Figure 6) and a clone (Rh32) carrying 
the 605 bp long 3'hu apoB SAR element (SEQ ID NO. 36 and 

15 Figure 5) . 

The DNA sequence of RhlO was determined by dideoxy- 
terminator method (Sanger, F. and Coulson, A.R., 1975, J. 
Mol. Biol. 94:441) and is shown in SEQ ID NO. 37 and 
Figure 6. The 2529 bp RhlO sequence consists of the 1212 

20 bp 5' distal human apoB SAR elements and the 1317 bp 5' 
proximal sequence in the 5' to 3 ' orientation. The DNA 
sequence of Rh3 2 was also determined, and was found to be 
identical to the sequence published in Levy-Wilson & 
Fortier (1989, J. Biol. Chem. 264:21196) (SEQ ID NO. 36 

25 and Figure 5) , except for a 60 base pair deletion spanning 
nucleotides 259 to 318, shown within brackets in Figure 5. 
Contrary to RhlO which is not a typical SAR sequence, Rh3 2 
is A/T rich and contains 22 copies of the ATATTT motif. 
Regulatory Elements of Expression Vectors 

3 0 Two basic expression vectors were derived from 

pAXlll and pAX142 (renamed pLW19 or pSB3 and pLW18 or 
pSB2, respectively) . pAXlll (Kay & Humphries, Methods 
Mol. Cell. Biol. 2:254, 1991) carries the human 
cytomegalovirus IE enhancer and promoter and the p-globin 

35 intron, while pAX142 carries the elongation factor-la 
promoter and intron. For each of these vectors there are 
single restriction sites located on each site of the EPO 
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transcription unit, where the SAR elements were introduced 
(Spel or EcoRV at the 5 ' end and Hpal at the 3 'end) . Maps 
of the two vectors pLW18 (pSB2) and pLW19 (psB3) are shown 
in Figures 7 and 8 respectively. 
5 Assembly of Expression Vectors 

The EP0 lon9 cDNA sequence (SEQ ID NO .35, Figure 4) or 
the genomic EPO DNA (SEQ ID NO. 33, Figure 3A) were 
introduced clockwise into the cloning sites of the 
expression vectors. In the case of the genomic clones, 

10 the P-globin and EF1 introns were removed from the pSB3 
and pSB2 vectors, respectively. A summary of the 
expression vectors is shown in Table 1. 
Construction of pLW24 and pLW2 5 Vectors 

The EP0 long cDNA (SEQ ID NO. 35, Figure 4) was removed 

15 from pLW21 by digestion with Hindlll, was blunt-ended with 
the T4 DNA polymerase and was introduced in the clockwise 
orientation into the Smal site of pLW18 and pLW19 to 
generate pLW24 and pLW25, respectively. 
Construction of P24MAR1 and P25MAR1 

20 Introduction of the 5' human apoB SAR element at the 

5' end of the EPO transcription unit was accomplished as 
follows. The 2,529 bp Xbal fragment from RhlO ( SEQ ID NO. 
37, Figure 6) was introduced in the clockwise orientation 
into the Spel site of pLW24 and pLW25 to generate p5MAR24 

25 and p5MAR25, respectively (Note that Xbal and Spel sites 
are compatible) . 

Introduction of the 60 bp-deleted 3 ' human apoB SAR 
element at the 3 ' end of the EPO transcription unit was 
accomplished as follows. The 605 bp Dral-Rsal fragment 

3 0 from the plasmid 12 DI Eco was added Xhol linkers and 
cloned into the Xhol site of the Stratagene pSBII SK( + ) 
plasmid, to generate plasmid 3 ' apoBX (Rh32) (SEQ ID NO. 
36, Figure 5) . The -60 5 bp SAR fragment was then removed 
from Rh32 by Xhol digestion, blunt-ended with the Klenow 

3 5 enzyme and cloned into the Hpal site of p5MAR24 and 
p5MAR25 to generate p24MAR-l and p25MAR-l, respectively. 
The orientation of the inserted fragments has not yet been 
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determined. 

Construction of PAP142 and PAP140 

Plasmids pSB2 and pSB3 were derived from pLW18 and 
pLWl9, respectively. A 12bp DNA linker with the EcoRV and 
5 SphI restriction sites was inserted into the Kpnl site of 
pLW18 and pLW19 to generate pSB2 and pSB3 , respectively. 
This allowed for introduction of SAR elements into the 
blunt-end EcoRV restriction site, just upstream of the EPO 
transcription unit. 
10 The EP0 long cDNA sequence was removed from pLW24 by 

digestion with Sail and cloned in the clockwise 
orientation into the Sail site of pSB2 to generate pAP13 . 
pAP5 was generated by removing the EP0 long cDNA sequence 
from pLW25 by digestion with BamHI and cloning in the 
15 clockwise orientation into the BamHI site of pSB3 to 
generate pAP5 . 

The Rh32 -605 bp Xhol SAR fragment {60 bp-deleted 3' 
human apoB SAR element) was blunt -ended with the T4 DNA 
polymerase and inserted in the clockwise orientation into 
20 the EcoRV site of pAPl3 and pAP5 to generate pAP138 and 
pAP13 6, respectively. The same Rh32 -605 bp Xhol blunt- 
ended SAR fragment was inserted in the clockwise 
orientation into the Hpal site of pAPl3 8 and pAPl3 6 to 
generate pAP142 and pAP140, respectively. 
25 Construction of DAP59 and oAP67 

pAP42 is an intronless version of pSB2 . pSB2 was 
amplified by the polymerase chain reaction between 
nucleotides 1308 and 321, and the amplified fragment was 
ligated on itself to produce pAP42 . pAP43 is an intronless 
30 version of pSB3 . The Smal - XmnI fragment (nucleotides 
677 - 1320) was removed from pSB3 to produce pAP43 . 

The EPO genomic clone was isolated from a Clontech 
library (Cat. No. HLl006d; Lot No. 19412) by probing with 
the BamHI EP0 8hort cDNA fragment from pAP4 . The EPO short cDNA 
3 5 fragment used as a probe to isolate the genomic EPO clone 
was a BamHI fragment from pAP4 . As for EP0 long cDNA, EPO shorc 
cDNA was first assembled into the Hindlll site of the 
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pUC18 vector, to generate the pLW2 0 vector, blunt-ended 
with the T4 DNA polymerase and inserted in the clockwise 
orientation into the Smal site of pLW19 to generate pLW23. 
Then the EPO short cDNA sequence was removed from pLW23 by 
5 BamHI digestion and cloned in the clockwise orientation 
into the BamHI site of pSB3 to generate pAP4 . 

Plaque hybridization was performed according to the 
method described in "Molecular Cloning. A Laboratory 
Manual, 2nd Edition. 1989". Edited by Sambrook, Fritsch, 

10 and Maniatis, Cold Spring Harbor Laboratory Press. pp 
2.108 - 2.121. A 7.5 kb BamHI fragment (cloned in the 
Stratagene pBS (KS+) plasmid) was identified as an EPO 
genomic clone. The identity of this clone was confirmed 
by polymerase chain reaction between primers EPOGEN 1 

15 (nucleotides 499- 518 in HUMERPA sequence) and EPOGEN 12 
(nucleotides 2848 - 2863, on complementary strand, in 
HUMERPA sequence) . This resulted in the expected 

fragment. The 2,365 bp amplified fragment was cloned in 
the clockwise orientation into the Hindi site of pUC19 to 

20 generate pAP41. The EPO genomic sequence was then removed 
from pAP41 by EcoRI and Hindlll digestion, blunt-ended 
with the T4 DNA polymerase and inserted in the clockwise 
orientation into the Smal site of pAP42 to generate pAP59 
or into the blunt-ended BamHI site of pAP43 to generate 

2 5 pAP67. 

Construction of PAP123, pAP127, pAP132 and PAP134 

The Rh32 -605 bp Xhol SAR fragment (60 bp-deleted 3' 
human apoB SAR element) was blunt -ended with the T4 DNA 
polymerase and inserted in the clockwise orientation into 

30 the Hpal site of pAP59 and pAP67 to generate pAP117 and 
pAP119, respectively. The same Rh32 -605 bp Xhol blunt- 
ended SAR fragment was inserted in the clockwise 
orientation into the EcoRV site of pAP117 and pAP119 to 
generate pAP123 and pAP127, respectively. 

35 The 2,529 bp Xbal fragment from RhlO was blunt-ended 

with the T4 DNA polymerase, and introduced in the 
clockwise orientation into the EcoRV site of pAP117 and 
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pAP119 to generate pAP132 and pAPl34, respectively. 

EXAMPLE 3 
Transfection and Expression of EPO 

Transf ection 

5 Briefly as described in more detail below CH0-K1 

cells were co-transf ected by the calcium phosphate 
precipitate method by two pairs of vectors: pSV2-neo 
carrying the resistance to the selective agent G-418 and 
one of the two following vectors expressing the EPO cDNA 

10 from the EF1 promoter: pLW24 or p24-MAR. pLW24 had no SAR 
element, while p24-MAR had the 5 ' apo B SAR element (RhlO) 
upstream of the EF 1 promoter region and the 60 bp-deleted 
3'apoB SAR element (Rh32) downstream of the EPO cDNA. 

G-418 was added to the medium to select cells 

15 transf ected with pSV2-neo. At least 80% of these 
transf ectants were expected to have been co-transf ected 
with the EPO vector. 

The following protocol was used for transfection 
with the Mammalian Transfection Kit by Stratagene (Catalog 

20 #200285) . On day -1 100mm culture dishes were inoculated 
with exponentially growing mammalian cells at a 
concentration of 5 x 10 4 cells per ml in 10 ml. The cells 
were grown overnight at 37°C with the appropriate level of 
C0 2 and in appropriate medium. For CH0-K1 cells Ham's F12 

25 complete medium with 10% fetal calf serum (FCS) was used 
and incubation was carried out with 5% C0 2 . The cells were 
approximately 10-2 0% confluent on day zero. 

The optimal amount of DNA to be used for 
transfection varied depending on the cell type being used 

30 for transfection. Usually, 10-30 \ig of plasmid DNA was 
used. The plasmid used for selection was generally added 
at some ratio to the expression vector. For pSV2neo, a 
ratio of 1:10 to 1:15 was found to be appropriate. 

The desired amount of DNA was diluted to 450 |il with 

35 double distilled H 2 0. 50 ^1 of solution 1 (2.5M Ca Cl 2 , 
included in the Stratagene kit) was added slowly and 
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dropwise. 500 jil of Solution 2 (2 x BBS, pH 6.95 
[consists of 50 mM n, n-bis (2-hydroxyethyl) -2- 
aminoethanesulfonic acid and buffered saline, 280 mM NaCl 
and 1.5 mM Na 2 HPOJ ) was added and mixed gently and 
5 slowly. It was found to be important to perform these two 
additions slowly and gently. The mixture was allowed to 
incubate at room temperature for 10-20 minutes. The 
precipitate was gently mixed to ensure adequate 
suspension. The precipitate was added to the culture 
10 dropwise while gently swirling the plate to distribute the 
suspension evenly, followed by incubation for 12-24 hours. 
Where spare incubator space was available, it was 
determined that transfection efficiency could be improved 
2-3 times by using lower C0 2 concentrations at this point 
15 (2-4% is recommended; 3% seemed to work well with CHO 
cells) . Normal C0 2 concentrations were resumed after 
removal of precipitate on Day 1. 

On day 1 the medium was removed by aspiration and 
the culture rinsed twice with sterile PBS (Phosphate 
20 buffered saline without Ca 2 \ Mg 2 *) or medium without serum. 
Fresh complete medium was applied (as on Day - 1) and the 
cells incubated under optimal C0 2 concentration for 24 
hours . 

The cells were split at a ratio ranging from 1:30 to 
25 1:100 into 96 well plates (0.2 ml per well). For CH0-K1 
cells, Ham's F12 complete medium with 20% FCS was used at 
this step. Incubation was carried out at 37°C with 5% C0 2 . 

On day 4 the medium was aspirated and selective 
medium was added. For transf ections using CHO-K1 cells 
3 0 and pSV2neo as the selective plasmid, Ham's F12 complete 
medium with 20% FCS + 270-400 fig/ml G418 was used. 

On day 5 or 6 (preferably day 5) and on day 8, the 
cultures were refed with selective medium. 

On day 10, spent medium was collected for RIA 
3 5 assays and frozen. The spent medium was replaced with 
selective medium. 

From day 11 RIA assays, were performed to determine 
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levels of EPO production in the medium, using undiluted 
samples. Samples producing high levels of EPO were cloned 
and subcultured in Ham's F12 complete medium with 10% FCS 
when they reached confluency. 
5 EPO Production 

Levels of EPO production in the medium were 
determined by RIA as follows. 

The following materials and equipment were used in 
the assay: phosphate RIA buffer: 0.05M NaP0 4 ; 0.2% BSA; 
10 0.02% NaN 3t pH 7.4; Erythropoietin {standards, Boehringer 
Mannheim, Cat.# 1120166, 250 U/ml) ; 125 I-EPO ( Amer sham Cat . # 
IM.219); anti-rabbit IgG (whole molecule) developed in 
goat, whole antiserum (Sigma Cat . # R-5001 (abbrev. Ab2 ) 
polyethylene glycol 8000 (BDH Cat . #B80016 ) ; normal rabbit 
15 serum (abbrev. NRS) ; Rabbit anti-EPO, polyclonal, lmg 
(abbrev. Abl) (R&D Systems Cat . # AB-286-NA) and; gamma 
counter (LKB-Wallac RIAGamma 1272). 

The stock concentration of EPO was 250 U/ml. Eight 
standard concentrations were prepared from a stock 
20 dilution of 20 U/ml sufficient for 4.5 ml of each standard 
or 20 assays in duplicate. 20 U/ml = 88 ^ll @ 250 U/ml + 
1012 \ll dilution buffer. The eight standard 

concentrations are shown in Table 2 . The standards were 
divided into 450 (ll aliquots in microcentrifuge tubes and 
25 stored at -20 'C until used. 

Antibody 1 (1 mg) was dissolved in 1 ml RIA/DB and 
stored in 20 [il aliquots in 0.5 ml microcentrifuge tubes 
at -20 # C until used (each aliquot was sufficient for 100 
assay tubes) . On the assay date, Abl was prepared at a 
30 working concentration of 2|ig/ml with dilution buffer (a 
1/500 dilution of the stock aliquot) to give a final assay 
concentration of 0.67 ^ig/ml in 300 |il when added as 100 ^1 
to all tubes except non-specific binding (NSB) and total 
counts (TO . 

35 Since all standard and sample binding inhibition % 

were based on the maximum binding (MB) , to avoid the 
possibility of having a poor reference duplicate, the MB 
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was determined in quadruplicate. 

The 125 I-EPO was diluted with 4.5 ml of dilution 
buffer and divided into 25 x 0.2 ml aliquots in 500 |il 
microcentrifuge tubes. A 5 Jll sample was added to 50 |il of 
5 dilution buffer in an 11x75 mm tube and counted for 1 min. 
in the RIAGamma using program 2 .The corrected CPM usually 
ranged between 4500 - 6000 CPM/^tl. The tracer aliquots 
were stored, with lead shielding, at -20 # C until used. 

Table 3 lists the assay setup and addition of 

10 reagents. The reagents were added and incubated in the 
following order: A) dilution buffer and NRS @ 1/33.3 were 
added to the appropriate tubes 3-8; B) EPO standards and 
sample dilutions were added to the appropriate tubes; C) 
antibody 1 was added to all tubes except 1-4 and; all 

15 tubes were vortexed and incubated at room temperature for 
4h. 125 I-EPO 0 6000 CPM/100 \il was added to all tubes. As 
the tracer activity decreases over time, the maximum 
binding % decreases resulting in a loss of sensitivity and 
accuracy. Therefore, we used 6000 CPM total counts during 

20 the first week of a new batch and increased by 1000 CPM 
per week thereafter. 

All tubes (except TC's) were vortexed and incubated 
overnight (18-20 hours) 9 4*C. Ab 2 and NRS (1/62.5) was 
added to all tubes except TC's. All tubes (except TC's) 

2 5 were vortexed and incubated at room temperature for 2 

hours. 1.5ml of 3.8% PEG 8000 (w/v in dilution buffer) was 
added to all tubes (except TC's) resulting in a final PEG 
concentration of 3%. All tubes (except TC's) were vortexed 
and incubated for 10 minutes at room temperature. All 
30 tubes were centrifuged (except TC's) at 1500 x g and 4"C 
for 20 minutes (1500 x g = 2800 RPM when using the IEC 
Centra-8R with the Cat.# 5737 12x75 mm tube adapters). 
The supernants were removed by aspiration and the tubes 
containing the pellets were counted in the gamma counter. 

3 5 The standard curve was plotted using linear 

regression of logit (B/Bo) vs log concentration where B = 
CPM bound and, Bo = maximum CPM bound (reference binding 
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of Ab 1 without inhibitor) . The regression line y = CO + 
Cl(x) where, CO = y intercept, CI = slope, x = Log 
(Concentration), y = Log (R/(l-R)) and, R = B/Bo. 

Interference of culture medium was tested by the 
5 addition of up to 95 |il of a possible 100 ^1 sample volume 
of Ham's F-12 containing 10% FCS . No effect on binding was 
observed. Recovery of EPO at 100, 300, and 600 mU/ml was 
93.85, 100.41, and 96.33% respectively. 

Screening was performed as follows. The assay was 

10 performed in one day for screening samples by using the 
following modifications to the above-noted method. 
Standards and sample dilutions were incubated with Abl for 
1 hour at 37 *C instead of 4 hours at room temperature. 125 I- 
EPO was incubated with all tubes for 1 hour at 37 *C 

15 instead of overnight at 4'C. Antibody 2 and NRS are 
incubated for 1 hour at 37 "C instead of 2 hours at room 
temperature. The remaining procedures were as described 
above and the results are given below. 

The first transfection series was performed with the 

20 following target vectors: PLW24 (EF1 promoter, EP0 long ) 
referred to as EPO-1 or; p24MAR-l (EF1 promoter, EP0 long , 
5'apoB & 3' 60 bp-deleted apoB SAR elements) referred to 
as EPO-1*. The selection plasmid was pSV2neo. The 
target : selection plasmid ratio was 5:1 (30 |ig:6p,g). 

2 5 One sample was picked up randomly from each 

transfection and was cloned by the dilution method. 
EPO-1-0-1 was cloned from the EPO-1 transfection and 
EPO-1-0-4* from the EPO-1* transfection (* indicates the 
presence of SAR elements in the target vector) . 

3 0 After the first round of cloning, the two EPO-1 

clones and four EPO-1" clones expressing the highest levels 
of EPO were selected and expanded. Levels of EPO 
production in the medium, determined by the RIA assays 
were as follows: EPO-1-1-5: 8 u/10 6 cells/day; EPO-1-1-6: 
35 9 u/10 6 cells/day; EPO-1-1-7*: 41 u/10 6 cells/day; EPO-1-1- 
8*: 30 u/10 6 cells/day; EPO-1-1-13*: 125 u/10 6 cells/day 
and; EPO-1-1-14*: 81 u/10 6 cells/day. 
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After a second round of cloning, levels of EPO 
expression were as follows: EPO-1-2-22: 5 u/10 6 cells/day; 
EPO-1-2-23: 7 u/10 6 cells/day; EPO-1-2-26: 13 u/10 6 
cells/day; EPO-1-2-27: 19 u/10 6 cells/day; EPO-1-2-15*: 13 
5 u/10 6 cells/day; EPO-1-2-16*: 20 u/10 6 cells/day and; EPO-1- 
2-17": 105 u/10 6 cells/day. 

The second transfection series was performed with 
the same target and selection vectors as the first 
transfection series. The target plasmid: selection plasmid 
10 ratio was 12:1 ( 3 Ojig : 2 . 5^lg) . 

At day 10 after transfection, EPO-2-0-11 (producing 
0.8 u/ml EPO) from EPO-2 transfection (target plasmid has 
no SAR elements) and EPO 2-0-9* (producing 0.6 u/ml) from 
EPO-2* transfection (target plasmid with SAR elements) were 
15 selected for cloning. 21 days after cloning, the levels of 
EPO production in the medium for EPO-2 -0-11 were as 
follows: 36/48 samples had <160 u/ml; 11/48 samples had 
160-800 u/ml; and 1/48 samples had 800 u/ml. Levels of 
EPO production for EPO-2-0-9* were as follows: 47/48 
20 samples had 1,000-4,000 u/ml and; 1/48 samples had <800 
u/ml . 

Two of the EPO-2 clones and three of the EPO-2* 
clones (expressing the highest levels of EPO per cell 
cluster) were expanded, and the levels of expression were 

25 as follows: EPO-2-1-24 (514 u/ml 21 days after cloning) 
and 109 u/10 6 cells/day after expansion; EPO-2-1-25 (800 
u/ml 21 days after cloning) and 182 u/10 6 cells/day after 
expansion; EPO-2-1-18* (1,962 u/ml 21 days after cloning) 
and 185 u/10 6 cells/day after expansion; EPO-2-1-19* (1,682 

30 u/ml 21 days after cloning) and 1,306 u/10 6 cells/day after 
expansion and; EPO-2-1-21* (1,777 u/ml 21 days after 
cloning) and 332 u/10 6 cells/day after expansion. 

EPO-2-1-19* was found to maintain its levels of EPO 
production while submitted to successive freezing-revival 

35 cycles as follows: 962 and 1,038 u/10 6 cells/day after the 
first cycle; 1,323 and 1,264 u/10 6 cells/day (between days 
3 and 4) after the second cycle and; 1,188 u/10 6 cells/day 
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(between days 3 and 4) after the third cycle. 

EPO-2-1-19* was submitted to a second round of 
cloning. The clones producing the highest levels of EPO 
produced between 1,500 and 1,700 u/10 6 cells/day (between 
5 days 3 and 4) . The EPO-2-1-19* cell line has remained 
stable and expressed constant levels of EPO for about 
eight months. 
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TABLE 1 

Assembly of expression vectors 



| EF1 

| promoter 


CMV 
promoter 


EPO 

sequence 


5' SAR 
element 


3' SAR 
element j 


S PLW24 Rh49 


pLW25 Rh2ie 


cDNA looo 






I p24MAR-1 

1 RH163 


P25MAR-1 

Rh169 


cDNA l0Ofl 


Rh10 


Rh32 


1 PAP142 Rh223 


PAP140 Rh233 


cDNA_ 


Rh32 


Rh32 


I PAP59 Rh183 


PAP67 Rh209 


genomic 






I PAP123 RH211 


PAP127 Rh208 


genomic 


Rh32 


Rh32 [ 


1 PAP132 Rh221 


PAP134 Rh222 


genomic 


Rh10 


Rh32 I 
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TABLE 2 



Standard No. 


20 U/ml Stock 
(ul) 


Dilution 
BufTer 
(ul) 


Standard 
Concentration 
(mil/ml) 




1 


4.5 


4495.5 


20 


2 


9.0 


4491.0 


40 


3 


18.0 


4482.0 


80 


4 


36.0 


4464.0 


160 


5 


67.5 


4432.5 


300 


6 


135.0 


4365.0 


600 


7 


270.0 


4230.0 


1,200 


8 


450.0 


4050.0 


2,000 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANTS: 

(A) NAME: Cangene Corporation 

(B) STREET : 104 Chancellor Matheson Road 

(C) CITY: Winnipeg 
<D) STATE: Manitoba 

( E ) COUNTRY : Canada 

(F) POSTAL CODE: R3T 2N2 

(G) TELEPHONE NO . : (204) 989-6850 

(H) TELEFAX NO.: (204) 269-7003 



(A) NAME: Delcuve, Genevieve 

(B) STREET: 74 McGill Place 

(C) CITY: Winnipeg 

(D) STATE: Manitoba 

( E ) COUNTRY : Canada 

(F) POSTAL CODE: R3T 5B2 

(ii) TITLE OF INVENTION: Recombinant DNA Molecules and Expression 
Vectors for Erythropoietin 



(iii) NUMBER OF SEQUENCES: 37 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: BERESKIN & PARR 

(B) STREET: 40 King Street West 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) ZIP: M5H 3Y2 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0 , Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 

(B) FILING DATE: 

( C ) CLASSIFICATION : 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: McDiarmid, Shona S. 

<B) REGISTRATION NUMBER: 3 8,798 

(C) REFERENCE /DOCKET NUMBER: 7841-013 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (416) 364-7311 

(B) TELEFAX: (416) 361-1398 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 
<B) CLONE: ' EPOl 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
AGCTTGCCCG GGATGAGGGC CACCGGTGTG GTCACCCGGC GCGCCCCAGG T 51 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP02 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CGCTGAGGGA CCCCGGCCAG GCGCGGAGAT GGGGGTGCAC GAAT 44 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP03 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTCCTGCCTG GCTGTGGCTT CTCCTGTCCC TGCTGTCGCT CCCTCTGGGC C 51 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 
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(B) CLONE: EP04 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TCCCAGTCCT GGGCGCCCCA CCACGCCTCA TCTGTGACAG CCGAGTCCTG GAGAGGTAC 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TO POLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP05 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CTCTTGGAGG CCAAGGAGGC CGAGAATATC ACGACGGGCT GTGCTGAACA TTGCAG 
{ 2 ) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP06 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CTTGAATGAG AATATCACTG TCCCAGACAC CAAAGTTAAT TTCTATGCCT GGAAGAGG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP07 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATGGAGGTCG GGCAGCAGGC CGTAGAAGTC TGGCAGGGCC TGGCCCTGCT GTCGG 55 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
AAGCTGTCCT GCGGGGCCAG GCCCTGTTGG TCAACTCTTC CCAGCCGTGG GAGCCCCTGC 6 0 

61 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP09 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GCTGCATGTG GATAAAGCCG TCAGTGGCCT TCGCAGCCTC ACCACTCTGC TTCG 54 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EPO10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



BNSDOCID: <WO 9619573A1_L> 



WO 96/19573 



v 



PCT/CA95/00696 



- 48 - 



GGCTCTGGGA GCCCAGAAGG AAGCCATCTC CCCTCCAGAT GCGGCCTCAG C 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EPOll 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TGCTCCACTC CGAACAATCA CTGCTGACAC TTTCCGCAAA CTCTTCCGAG T 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP012 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CTACTCCAAT TTCCTCCGGG GAAAGCTGAA GC TGTAC AC A GGGGAGG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP013 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCTGCAGGAC AGGGGACAGA TGACCAGGTG TGTCGACCTG GGCATATC 
(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EPO!3b 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCTGCAGGAC AGGGGACAGA TGACCAGGTG TGTCCACCTG GGCATATC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP015 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CACCACCTCC CTCACCAATA TTGCTTGTGC CACACCCTCC CCCGCCACTC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP015 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CTGAACCCCG TCGAGGGGCT CTCAGCTCAG CGCCAGCCTG TCCCATGGAC CA 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 5 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EPOl alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CCTCAGCGAC CTGGGGCGCG CCGGGTGACC AC AC CGGTGG CCCTCATCCC GGGCA 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP02 alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GGCAGGACAT TCGTGCACCC CCATCTCCGC GCCTGGCCGG GGTC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP03 alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGACTGGGAG GCCCAGAGGG AGCGACAGCA GGGACAGGAG AAGCCACAGC CA 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP04 alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CTCTCCAGGA CTCGGCTGTC ACAGATGAGG CGTGGTGGGG CGCCCA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EPOS alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TTCAGCACAG CCCGTCGTGA TATTCTCGGC CTCCTTGGCC TCCAAGAGGT AC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP06A 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
AGGCATAGAA ATTAACTTTG GTGTCTGGGA CAGTGATATT CTCATTCAAG CTGCAATG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP07 alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
AGGGCCAGGC CCTGCCAGAC TTCTACGGCC TGCTGCCCGA CCTCCATCCT CTTCC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP08 alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GGGGCTCCCA CGGCTGGGAA GAGTTGACCA ACAGGGCCTG GCCCCGCAGG ACAGCTTCCG 60 

ACAGC , 65 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP09 alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AGTGGTGAGG CTGCGAAGGC CACTGACGGC TTTATCCACA TGCAGCTGCA 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: EPO10 alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CGCATCTGGA GGGGAGATGG CTTCCTTCTG GGCTCCCAGA GCCCGAAGCA G 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EPOll alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AGAC TCGGAA GAGTTTGCGG AAAGTGTCAG CAGTGATTGT TCGGAGTGGA GCAGCTGAGG 60 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EPO 12 alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CCTGCAGGCC TCCCCTGTGT ACAGCTTCAG CTTTCCCCGG AGGAAATTGG AGT 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Homo sapiens 

(vii) IMMEDIATE SOURCE: 
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(B) CLONE: EP013 alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GGTGGTGGAT ATGCCCAGGT CGACACACCT GGTCATCTGT CCCCTGT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EPO!3b alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GGTGGTGGAT ATGCCCAGGT GGACACACCT GGTCATCTGT CCCCTGT 
{2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: EP014 alpha 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GGGTTCAGGA GTGGCGGGGG AGGGTGTGGC ACAAGCAATA TTGGTGAGGG A 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: EPOIB alpha 



- 55 - 



PCT/CA95/00696 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 32: 
AGCTTGGTCC ATGGGACAGG CTGGCGCTGA GCTGAGAGCC CCTCGACG 48 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: j oin ( 62 5 . . 637 , 1201.. 1346, 1605.. 1691, 
2303 . .2482, 

2617 . .2772) 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: join ( 62 5 . . 637 , 1201.. 1346, 1605.. 1691, 
2303. .2482, 

2617. .2772) 

(ix) FEATURE: 

(A) NAME /KEY : mat_pept ide 

(B) LOCATION: j oin ( 1 2 69 . . 13 46 , 1605.. 1691, 2303.. 2482, 2617 

. .2769) 

(ix) FEATURE: 

(A) NAME / KEY : prim_transcript 

(B) LOCATION: j oin ( 62 5 . . 3 3 37 ) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

AAGCTTCTGG GCTTCCAGAC CCAGCTACTT TGCGGAACTC AGCAACCCAG GCATCTCTGA 6 0 

GTCTCCGCCC AAGACCGGGA TGCCCCCCAG GGGAGGTGTC CGGGAGCCCA GCCTTTCCCA 12 0 

GATAGCACGC TCCGCCAGTC CCAAGGGTGC GCAACCGGCT GCACTCCCCT CCCGCGACCC 180 

AGGGCCCGGG AGCAGCCCCC ATGACCCACA CGCACGTCTG CAGCAGCCCC GCTCACGCCC 2 40 

CGGCGAGCCT CAACCCAGGC GTCCTGCCCC TGCTCTGACC CCGGGTGGCC CCTACCCCTG 3 00 

GCGACCCCTC ACGCACACAG CCTCTCCCCC ACCCCCACCC GCGCACGCAC ACATGCAGAT 3 60 

AACAGCCCCG ACCCCCGGCC AGAGCCGCAG AGTCCCTGGG CCACCCCGGC CGCTCGCTGC 420 

GCTGCGCCGC ACCGCGCTGT CCTCCCGGAG CCGGACCGGG GCCACCGCGC CCGCTCTGCT 4 80 

CCGACACCGC GCCCCCTGGA CAGCCGCCCT CTCCTCTAGG CCCGTGGGGC TGGCCCTGCA 540 

CCGCCGAGCT TCCCGGGATG AGGGCCCCCG GTGTGGTCAC CCGGCGCGCC CCAGGTCGCT 600 

GAGGGACCCC GGCCAGGCGC GGAG ATG GGG GTG CAC G GTGAGTACTC 647 

Met Gly Val His 
-27 -25 

GCGGGCTGGG CGCTCCCGCC GCCCGGGTCC CTGTTTGAGC GGGGATTTAG CGCCCCGGCT 707 
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ATTGGCCAGG AGGTGGCTGG GTTCAAGGAC CGGCGACTTG TCAAGGACCC CGGAAGGGGG 7 67 

AGGGGGGTGG GGCAGCCTCC ACGTGCCAGC GGGGACTTGG GGGAGTCCTT GGGGATGGCA 827 

AAAACCTGAC CTGTGAAGGG GACACAGTTT GGGGGTTGAG GGGAAGAAGG TTTGGGGGTT 887 

CTGCTGTGCC AGTGGAGAGG AAGCTGATAA GCTGATAACC TGGGCGCTGG AGCCACCACT 947 

TATCTGCCAG AGGGGAAGCC TCTGTCACAC CAGGATTGAA GTTTGGCCGG AGAAGTGGAT 1007 

GCTGGTAGCT GGGGGTGGGG TGTGCACACG GCAGCAGGAT TGAATGAAGG CCAGGGAGGC 1067 

AGCACCTGAG TGCTTGCATG GTTGGGGACA GGAAGGACGA GCTGGGGCAG AGACGTGGGG 1127 

ATGAAGGAAG CTGTCCTTCC ACAGCCACCC TTCTCCCTCC CCGCCTGACT CTCAGCCTGG 1187 

CTATCTGTTC TAG AA TGT CCT GCC TGG CTG TGG CTT CTC CTG TCC CTG 12 35 
Glu Cys Pro Ala Trp Leu Trp Leu Leu Leu Ser Leu 
-22 -20 -15 

CTG TCG CTC CCT CTG GGC CTC CCA GTC CTG GGC GCC CCA CCA CGC CTC 12 83 
Leu Ser Leu Pro Leu Gly Leu Pro Val Leu Gly Ala Pro Pro Arg Leu 
-10 -5 1 5 

ATC TGT GAC AGC CGA GTC CTG GAG AGG TAC CTC TTG GAG GCC AAG GAG 13 31 
lie Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu Leu Glu Ala Lys Glu 
10 15 20 

GCC GAG AAT ATC ACG GTGAGACCCC TTCCCCAGCA CATTCCACAG AACTCACGCT 13 86 
Ala Glu Asn lie Thr 
25 

CAGGGCTTCA GGGAACTCCT CCCAGATCCA GGAACCTGGC ACTTGGTTTG GGGTGGAGTT 1446 

GGGAAGCTAG ACACTGCCCC CCTACATAAG AATAAGTCTG GTGGCCCCAA ACCATACCTG 1506 

GAAAC T AGGC AAGGAGCAAA GCCAGCAGAT CCTACGGCCT GTGGGCCAGG GCCAGAGCCT 1566 

TCAGGGACCC TTGACTCCCC GGGCTGTGTG CATTTCAG ACG GGC TGT GCT GAA 1619 

Thr Gly Cys Ala Glu 
30 

CAC TGC AGC TTG AAT GAG AAT ATC ACT GTC CCA GAC ACC AAA GTT AAT 1667 
His Cys Ser Leu Asn Glu Asn lie Thr Val Pro Asp Thr Lys Val Asn 
35 40 45 

TTC TAT GCC TGG AAG AGG ATG GAG GTGAGTTCCT TTTTTTTTTT TTTTCCTTTC 1721 
Phe Tyr Ala Trp Lys Arg Met Glu 
50 55 

TTTTGGAGAA TCTCATTTGC GAGCCTGATT TTGGATGAAA GGGAGAATGA TCGGGGGAAA 17 81 

GGTAAAATGG AGCAGCAGAG ATGAGGCTGC CTGGGCGCAG AGGCTCACGT CTATAATCCC 1841 

AGGCTGAGAT GGCCGAGATG GGAGAATTGC TTGAGCCCTG GAGTTTCAGA CCAACCTAGG 1901 

CAGCATAGTG AGATCCCCCA TCTCTACAAA CATTTAAAAA AATTAGTCAG GTGAAGTGGT 19 61 

GCATGGTGGT AGTCCCAGAT ATTTGGAAGG CTGAGGCGGG AGGATCGCTT GAGCCCAGGA 2 021 

ATTTGAGGCT GCAGTGAGCT GTGATCACAC CACTGCACTC CAGCCTCAGT GACAGAGTGA 2081 

GGCCCTGTCT CAAAAAAGAA AAGAAAAAAG AAAAATAATG AGGGCTGTAT GGAATACATT 2141 

CATTATTCAT TCACTCACTC ACTCACTCAT TCATTCATTC ATTCATTCAA CAAGTCTTAT 22 01 



8NSDOCID: <WO 9619573A1_I_> 



- 57 - 



PCT/CA95/00696 



TGCATACCTT CTGTTTGCTC AGCTTGGTGC TTGGGGCTGC TGAGGGGCAG GAGGGAGAGG 2261 

GTGACATGGG TCAGCTGACT CCCAGAGTCC ACTCCCTGTA G GTC GGG CAG CAG 2314 

Val Gly Gin Gin 

GCC GTA GAA GTC TGG CAG GGC CTG GCC CTG CTG TCG GAA GCT GTC CTG 23 62 
Ala Val Glu Val Trp Gin Gly Leu Ala Leu Leu Ser Glu Ala Val Leu 
60 65 70 75 

CGG GGC CAG GCC CTG TTG GTC AAC TCT TCC CAG CCG TGG GAG CCC CTG 2410 
Arg Gly Gin Ala Leu Leu Val Asn Ser Ser Gin Pro Trp Glu Pro Leu 
80 85 90 

CAG CTG CAT GTG GAT AAA GCC GTC AGT GGC CTT CGC AGC CTC ACC ACT 2458 
Gin Leu His Val Asp Lys Ala Val Ser Gly Leu Arg Ser Leu Thr Thr 
95 100 105 

CTG CTT CGG GCT CTG GGA GCC CAG GTGAGTAGGA GCGGACACTT CTGCTTGCCC 2512 
Leu Leu Arg Ala Leu Gly Ala Gin 
110 115 

TTTCTGTAAG AAGGGGAGAA GGGTCTTGCT AAGGAGTACA GGAACTGTCC GTATTCCTTC 2572 

CCTTTCTGTG GCACTGCAGC GACCTCCTGT TTTCTCCTTG GCAG AAG GAA GCC ATC 2 62 8 

Lys Glu Ala lie 

TCC CCT CCA GAT GCG GCC TCA GCT GCT CCA CTC CGA AC A ATC ACT GCT 2676 
Ser Pro Pro Asp Ala Ala Ser Ala Ala Pro Leu Arg Thr lie Thr Ala 
120 125 130 135 

GAC ACT TTC CGC AAA CTC TTC CGA GTC TAC TCC AAT TTC CTC CGG GGA 2724 
Asp Thr Phe Arg Lys Leu Phe Arg Val Tyr Ser Asn Phe Leu Arg Gly 
140 145 150 

AAG CTG AAG CTG TAC ACA GGG GAG GCC TGC AGG ACA GGG GAC AGA TGACCAGGTG 
2779 

Lys Leu Lys Leu Tyr Thr Gly Glu Ala Cys Arg Thr Gly Asp Arg 
155 160 165 



TGTCCACCTG 


GGCATATCCA 


CCACCTCCCT 


CACCAACATT 


GCTTGTGCCA 


CACCCTCCCC 


2839 


CGCCACTCCT 


GAACCCCGTC 


GAGGGGCTCT 


CAGCTCAGCG 


CCAGCCTGTC 


CCATGGACAC 


2899 


TCCAGTGCCA 


GCAATGACAT 


CTCAGGGGCC 


AGAGGAACTG 


TCCAGAGAGC 


AACTCTGAGA 


2959 


TCTAAGGATG 


TCACAGGGCC 


AACTTGAGGG 


CCCAGAGCAG 


GAAGCATTCA 


G AG AGC AGC T 


3019 


TTAAACTCAG 


GGACAGAGCC 


ATGCTGGGAA 


GACGCCTGAG 


CTCACTCGGC 


ACCCTGCAAA 


3079 


ATTTGATGCC 


AGGACACGCT 


TTGGAGGCGA 


TTTACCTGTT 


TTCGCACCTA 


CCATCAGGGA 


3139 


CAGGATGACC 


TGGAGAACTT 


AGGTGGCAAG 


CTGTGACTTC 


TCCAGGTCTC 


ACGGGCATGG 


3199 


GCACTCCCTT 


GGTGGCAAGA 


GCCCCCTTGA 


CACCGGGGTG 


GTGGGAACCA 


TGAAGACAGG 


3259 


ATGGGGGCTG 


GCCTCTGGCT 


CTCATGGGGT 


CCAAGTTTTG 


TGTATTCTTC 


AACCTCATTG 


3319 


ACAAGAACTG 


AAACCACCAA 


TATGACTCTT 


GGCTTTTCTG 


TTTTCTGGGA 


ACCTCCAAAT 


3379 


CCCCTGGCTC 


TGTCCCACTC 


CTGGCAGCAG 


TGCAGCAGGT 


CCAGGTCCGG 


GAAATGAGGG 


3439 


GTGGAGGGGG 


CTGGGCCCTA 


CGTGCTGTCT 


CACACAGCCT 


GTCTGACCTC 


TCGACCTACC 


3499 
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GGCCTAGGCC ACAAGCTCTG CCTACGCTGG TCAATAAGGT GTCTCCATTC AAGGCCTCAC 3559 
CGCAGTAAGG CAGCTGCCAA CCCTGCCCAG GGCAAGGCTG CAG 3 602 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Gly Val His Glu Cys Pro Ala Trp Leu Trp Leu Leu Leu Ser Leu 
-27 -25 -20 -15 

Leu Ser Leu Pro Leu Gly Leu Pro Val Leu Gly Ala Pro Pro Arg Leu 
-10 . -5 15 

lie Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu Leu Glu Ala Lys Glu 
10 15 20 

Ala Glu Asn lie Thr Thr Gly Cys Ala Glu His Cys Ser Leu Asn Glu 
25 30 35 

Asn lie Thr Val Pro Asp Thr Lys Val Asn Phe Tyr Ala Trp Lys Arg 
40 45 50 

Met Glu Val Gly Gin Gin Ala Val Glu Val Trp Gin Gly Leu Ala Leu 
55 60 65 

Leu Ser Glu Ala Val Leu Arg Gly Gin Ala Leu Leu Val Asn Ser Ser 
70 75 80 85 

Gin Pro Trp Glu Pro Leu Gin Leu His Val Asp Lys Ala Val Ser Gly 
90 95 100 

Leu Arg Ser Leu Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lys Glu 
105 HO H5 

Ala He Ser Pro Pro Asp Ala Ala Ser Ala Ala Pro Leu Arg Thr He 
120 125 130 

Thr Ala Asp Thr Phe Arg Lys Leu Phe Arg Val Tyr Ser Asn Phe Leu 
135 140 145 

Arq Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala Cys Arg Thr Gly Asp 
150 * 155 160 165 

Arg 

(2) INFORMATION FOR SEQ ID NO : 3 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 

<B) CLONE: EPOlong 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 : 

AGCTTGCCCG GGATGAGGGC CACCGGTGTG GTCACCCGGC GCGCCCCAGG TCGCTGAGGG 60 

ACCCCGGCCA GGCGCGGAGA TGGGGGTGCA CGAATGTCCT GCCTGGCTGT GGCTTCTCCT 12 0 

GTCCCTGCTG TCGCTCCCTC TGGGCCTCCC AGTCCTGGGC GCCCCACCAC GCCTCATCTG 180 

TGACAGCCGA GTCCTGGAGA GGTACCTCTT GGAGGCCAAG GAGGCCGAGA ATATCACGAC 240 

GGGCTGTGCT GAACATTGCA GCTTGAATGA GAATATCACT GTCCCAGACA CCAAAGTTAA 3 00 

TTTCTATGCC TGGAAGAGGA TGGAGGTCGG GCAGCAGGCC GTAGAAGTCT GGCAGGGCCT 3 60 

GGCCCTGCTG TCGGAAGCTG TCCTGCGGGG CCAGGCCCTG TTGGTCAACT CTTCCCAGCC 42 0 

GTGGGAGCCC CTGCAGCTGC ATGTGGATAA AGCCGTCAGT GGCCTTCGCA GCCTCACCAC 480 

TCTGCTTCGG GCTCTGGGAG CCCAGAAGGA AGCCATCTCC CCTCCAGATG CGGCCTCAGC 540 

TGCTCCACTC CGAACAATCA CTGCTGACAC TTTCCGCAAA CTCTTCCGAG TCTACTCCAA 600 

TTTCCTCCGG GGAAAGCTGA AGCTGTACAC AGGGGAGGCC TGCAGGACAG GGGACAGATG 66 0 

ACCAGGTGTG TCCACCTGGG CATATCCACC ACCTCCCTCA CCAATATTGC TTGTGC C AC A 72 0 

CCCTCCCCCG CCACTCCTGA ACCCCGTCGA GGGGCTCTCA GCTCAGCGCC AGCCTGTCCC 7 80 

ATGGACCA 7 8 8 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Homo sapiens 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Rh 32 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 : 
AAAAAGATGA GGTAATTGTG TTTTTATAAT TAAATATTTT ATAATTAAAA TATTTATAAT 60 

TAAAATATTT ATAATTAAAT ATTTTATAAT TAAAATATTT ATAATTAAAT ATTTTATAAT 12 0 

TAAAATATTT ATAATTAAAT ATTTTATAAT TAAAATATTT ATAATTAAAT ATTTTATAAT 180 

TAAAATATTT ATAATTAAAT ATTTTATAAT TAAAATATTT ATAATTAAAT ATTTTATAAT 24 0 

TAAAATATTT ATAATTAAAT ATTTTATAAT TAAAATGTTT ATAATTACAT ATTTTATAAT 300 

TAAAATGTTT ATAATTACAT ATTTTATAAT TAAAATGTTT ATAATTACAT ATTTTATAAT 3 60 
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TAAAATGTTT ATAATTACAT ATTTTATAAT TACATATTTT ATAAAGTATT TATAATTACA 420 

TATTTTATAA TTAAAGTATT TATAATTACA TATTTTATAA TTAAAGTATT TATAATTACA 480 

TATTTTATAA TTCAATATTT T AT AAAT AG T TAAAAAGACG AGGAAAAAAT TAAAAAGACG 540 

AGGTTATTGA TCTCAGGAAT TGTATTTGCC AAGTGAGAAG GAAAAAATAT TCACAAAGGC 600 

, TTGTA 605 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2529 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(vii) IMMEDIATE SOURCE: 
<B) CLONE: Rh 10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

TCTAGACCCC AGTTTCTCTA TAAGATGAGA ATATTAGTCA CGATTTGGTT TCTAAGATCC 60 

TGTCTATGTT TGAGACTACA GATACCTGTT GCTACATTTC CCTTCATAGC TCTGAACAAG 12 0 

GAGAATTCAG CCCAATTCTC ATGGCCTTCT AAACAATCCA GAGTTTCAGT GCCATAAGGT 180 

ACTACAATTT AGTGTCAAAT TAAGTCAAAG GCTTCATTAG CCTGAAAGCT CTGTCCCTGG 240 

CCTGGGCATG GCAAACTGTA TCCCCCACTG ACCATCCCCC TGTCTCCCTT CTCCCCAGAG 300 

ACTCCAGTAG CCTGGCGTCA TCACAGGGGC CAGACATATC CAACATGTTC CCAGCTTCCT 3 60 

GCCACTTGAC TTTCAGTGTG CCTCCCTCTT CAGTTACCCA AATCCTGCCC ACCATTCCAG 420 

AGCCAGTTCA ATCTCACCCA TCCAGGACCC CCGAGACCCC CATCGTACCA CTATAGTCTA 480 

ACTGTGGTGT AGACCCCACA C TGGGC AC AT TGCGTACGCT CATTATTGGC TGTGACGTCT 54 0 

GATTATGCCC TTCTCCTGGT CTGGAAGCTC TCGGAGGTGC TCCATAATAC ATGAAGAGAA 600 

GTAGTGCTGG TGTGGGAATA GTGAGGTGTG TTTATCCATC CAGCTATCCG GCACCAGCAC 66 0 

TGGTCTCAGC TTTCTGAGGT AACACGTTCT GAGCCTTAGT CTTGAGAGAA CATAAAGAAA 7 20 

ACTTTTTTTA AAAGTAGTAA AAAGTGGCTG ACAAAAGCTG ACCAAAAGCC TTCAAAAGAA 780 

ATGCTAAGTT ATATCTAAGA AAGTTT AC C C AAGGTCAGGC AAA T ATG AAA CCTAAAGCTA 84 0 

GACGTGGGGA AGAACTTCCG GAGAGTTGCA ATTCCCTGTG CCCCAGCATC CCCAGGAGGG 900 

CATGCCCACA TCTGATTTAG AAATCTGTGT AAAATGAGTG AAGGTTTCTA TTTCTTGGGC 960 

AGTGTGGGCA CAGGTCTTTG GAG AGGTC G A TGGCCTCCCA TAAAATCCTT CCTGCTTGAT 102 0 

GGTTCTGGAT CCTCAGCCAC AGCTCCTAAT AGCCATGAGG TTTGAGCCCA AAATAATTTA 108 0 

TGTGTTTGTT TTTTCAGCCC CAAAATTTCC ATAGAATCAA AGTAGTCAGA GCTGAATGGG 1140 
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GCTAAGAGAC CGTCCATTCC TGTCTTCTCA TCACAGATGA GGGACTGCCA CCCAGAGCCG 12 00 
TAGAAACTGT CCCATGGCCC CAGTTCCCAG ACCCTTCCTC TCTCCTACAG CTCCAAGTTC 12 60 
ACTGTGCATT CTAAATGAAG ATGTAAACAT AGGCAGCAAC ACTCAAGAGT AAAAATGAAG 13 20 
TGTGCATATG AAAGAAACCT ATTCACATGG ACCATATTAC ATTATAATCA CAGTGTTTAC 13 80 
TGCTTGACTA CCATCTGCCT GGGCTAGCAA GGGTGTCAGT GAGGAAGAGA GGACAAGGGG 14 40 
TACCAATCTG TGAACTACAC ATGGTTCTTG CTCTCCCAGC TTCTCTCTCC CATTGGCAAG 1500 
GCAACAGGTA AACACATGAA AAATCAAATA ATGCTATAAG AGAAAAATGT ATTCAGGACA 1560 
ACAACAGGTT TGTATGAAGG CCTTTCATCA TCGTTGTCCT ACCTAGAAAC TGAATGACAG 1620 
GGAATCAGAG TCACAAGCTA TGAAGTCTAA CTGGGCTGGT C C C AGAG AAA GATTCAGTGC 1680 
AGTAGGTGGG GCTGCAGCCA GCCCTGGGTG GGTGGAAGGA TGACATCCAC ATAGGCAAGA 1740 
GGGTGATAAT TCACTTGCGC AGCTCCTCAC TGCACATTGA ACCCTGCTGA CTTCTGGCTT 1800 
CTCTCCCGGG AGGAACTGCG ACTCAACATT CTGACCTTAT CTCTTGGGTA GCAGAATGAT 1860 
GGAGAAGGAA AGTTTCTTTT TGCTTCTCGC AGGGGTTAAT CATCCATCTG GAATGCCTAC 192 0 
ATTTGGTTGA CAATGGCTCA CCCTATCATC TTCCTCCTGA ACCATTCACC TAAATGTGCC 1980 
ATTTCTTTCC TGATAGTTCT CATTTGTGTG TGTGTGTGTG TGTGTGTGTG TGCACGTGCT 2 040 
CACACATGCA TGCTGTCACT GGGTAAACAG GCCACCCTGG GCACAGTTCC ATCTACAATG 2100 
TTTGAAGTTT ACTTTCCAGC TTCTGGGCAT CATTTGCAAT TATAATGCTG TCACAGGCAG 2160 
AAACGAGATA GGCTAATTAA TCGTTGTCAA TACTGATCCC TATTTGCCAG ATGAGATTTT 2 220 
GGAGC AGCAT GGCTGGGAAT AATTGGTATA GACTGTATTT CCTTGCTTTA TGTCACTGGA 22 8 0 
AATATTTATT TAAGCATCAC GGTCGCTATG CATAAATATC CTGGAAAATG GGGTATAGCT 2 34 0 
GAATGGTGCA GATTCATTCA TTCATATTCA GCAAATTATG TTCTAAGCAC CTACTTCAGT 2400 
ACGTGAACAG C AC TAAACTC AGAATATTGG TCTGCTGGGG TCCTTTATTA GCTTCCATGA 246 0 
TTCCCTGAAC TTGGC C AAG A CCCTTCTGGT CGGCTGCAGA TAGGCACAAT GGATAGTTTT 2 520 
GCTTCTAGA 2 52 9 
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I CIAIM 

1. A recombinant DNA molecule adapted for transfection 
of a host cell comprising a nucleic acid molecule encoding 
mammalian erythropoietin, an expression control sequence 

5 operatively linked thereto and at least one SAR element. 

2. A recombinant DNA molecule as claimed in claim 1 
wherein the nuclecic acid molecule encodes human 
erythropoeitin . 

3. a recombinant DNA molecule as claimed in claim 2 
10 wherein the nucleic acid molecule encodes erythropoietin 

having the amino acid sequence shown in SEQ ID NO 34. 

4. A recombinant DNA molecule as claimed in claim 3 
wherein the nucleic acid molecule has the sequence as 
shown in SEQ ID NO. 35. 

15 5. a recombinant DNA molecule as claimed in claim 1 

wherein the SAR element is a SAR element which co-maps 
with a chromatin domain boundary. 

6. A recombinant DNA molecule as claimed in claim 5 
wherein the SAR element is a human apolipoprotein SAR 

20 element. 

7. A recombinant DNA molecule as claimed in claim 5 
wherein the SAR element comprises the sequence as shown in 
SEQ ID NO. 3 6 or 37 . 

8. A recombinant DNA molecule as claimed in claim 1 
25 wherein the nucleic acid molecule encoding mammalian 

erythropoietin and the expression control sequence 
operatively linked thereto are flanked by SAR elements. 

9. An expression vector comprising a recombinant DNA 
molecule as claimed in claim 1. 
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10. An expression vector comprising a nucleic acid 
molecule encoding erythropoietin and having the sequence 
shown in SEQ ID NO. 35 under the control of a promoter, 
flanked by 5' and 3' apolipoprotein SAR elements. 

5 11. An expression vector as claimed in claim 10 wherein 
the promoter is a human cytomegalovirus IE enhancer and 
promoter, or an elongation factor-1 alpha promoter. 

12. A mammalian cell transformed with an expression 
vector as claimed in claim 9. 

10 13 . A mammalian cell lacking a selectable gene capable 
of being amplified, said cell being capable of expressing 
recombinant EPO in vitro at levels of at least 2,000 u/10 6 
cells in 24 hours. 

14. A method of expressing recombinant mammalian 
15 erythropoietin comprising the steps of culturing a 

mammalian cell as claimed in claim 13 in a suitable medium 
until sufficient amounts of erythropoietin are produced by 
the cell and separating the erythropoietin produced. 

15. A method of preparing recombinant erythropoietin 
20 comprising transfecting a mammalian cell with an 

expression vector comprising a recombiant DNA molecule as 
claimed in claim 1, and culturing the transfected cell in 
a suitable medium until sufficient amounts of 
erythropoietin are produced by the cell and separating the 
25 erythropoietin produced. 

16. A method as claimed in claim 14 wherein the 
mammalian cell is further transfected with a selectable 
marker gene and wherein transfected cells are selected by 
means of the selectable marker gene. 

30 17. A method as claimed in claim 16 wherein the 
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selectable marker gene is neo. 

18. A method as claimed in claim 14 comprising the 
additional step of identifying and selecting cells 
producing high levels of erythropoietin; subcloning the 
5 selected cells; establishing long term cell lines from the 
selected cells and; culturing the selected cells in a 
suitable medium until sufficient amounts of erythropoietin 
are produced by the cell and separating the erythropoietin 
produced. 

10 19. a method as claimed in claim 14 wherein 
erythropoietin is produced at levels of at least 1,500 
u/10 6 cells in 24 hours in the absence of gene 
amplification . 

20. Erythropoietin produced by the method of claim 14. 

15 21. A transgenic non-human animal or embryo whose germ 
cells and somatic cells contain a DNA construct comprising 
the recombinant DNA molecule as claimed in claim 1. 
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FIGURE 3A 
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FIGURE 3B 

MGVHECPAWLWLLLSLLSLPLGLPVLGAPPRLICDSRVLERYLLEAKEAENITTGCAEHC 
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HVDKAVSGLRSLTTLLRALGAQKEAISPPDAASAAPLRTITADTFRKLFRVYSNFLRGKL 
KLYTGEACRTGDR 
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FIGURE 4 

AGCTTGCCCG GGATGAGGGC CACCGGTGTG GTCACCCGGC GCGCCCCAGG TCGCTGAGGG 60 

ACCCCGGCCA GGCGCGGAGA TGGGGGTGCA CGAATGTCCT GCCTGGCTGT GGCTTCTCCT 12 0 

GTCCCTGCTG TCGCTCCCTC TGGGCCTCCC AGTCCTGGGC GCCCCACCAC GCCTCATCTG 180 

TGACAGCCGA GTCCTGGAGA GGTACCTCTT GGAGGCCAAG GAGGCCGAGA ATATCACGAC 2 40 

GGGCTGTGCT GAACATTGCA GC TTGAATG A GAATATCACT GTCCCAGACA CCAAAGTTAA 3 00 

TTTCTATGCC TGGAAGAGGA TGGAGGTCGG GCAGCAGGCC GTAGAAGTCT GGCAGGGCCT 3 60 

GGCCCTGCTG TCGGAAGCTG TCCTGCGGGG CCAGGCCCTG TTGGTCAACT CTTCCCAGCC 42 0 

GTGGGAGCCC CTGCAGCTGC ATGTGGATAA AGCCGTCAGT GGCCTTCGCA GCCTCACCAC 4 80 

TCTGCTTCGG GCTCTGGGAG CCCAGAAGGA AGCCATCTCC CCTCCAGATG CGGCCTCAGC 54 0 

TGCTCCACTC CGAACAATCA CTGCTGACAC TTTCCGCAAA CTCTTCCGAG TCTACTCCAA 600 

TTTCCTCCGG GGAAAGCTGA AGCTGTACAC AGGGGAGGCC TGCAGGACAG GGGACAGATG 66 0 

ACCAGGTGTG TCCACCTGGG CATATCCACC ACCTCCCTCA CCAATATTGC TTGTGCCACA 720 

CCCTCCCCCG CCACTCCTGA ACCCCGTCGA GGGGCTCTCA GCTCAGCGCC AGCCTGTCCC 7 80 

ATGGACCA - Q Q 
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FIGURE 5 

1 ***** CATGA GCTAATTCTC TTTTTATAAT TAAATATTTT ATAATTAAAA TATTTATAAT 
TTTTTCTACT CCATTAACAC AAAAATATTA ATTTATAAAA TATTAATTTT IJaIIJa^I 

61 T A ±!± TATTT ATAATTAAAT ATTTTATAAT TAAAATATTT ATAATTAAAT ATTTTATAAT 
ATTTTATAAA TATTAATTTA TAAAATATTA ATTTTATAAA TATTAATTTA TAAAAtItoI 

121 T A ±^T ATTT ATAATTAAAT ATTTTATAAT TAAAATATTT ATAATTAAAT ATTTTATAAT 
ATTTTATAAA TATTAATTTA TAAAATATTA ATTTTATAAA TATTAATTTA TAAAATATTA 

181 T A £!£T A ,T TT ATAATTAAAT ATTTTATAAT TAAAATATTT ATAATTAAAT ATTTTATAAT 
ATTTTATAAA TATTAATTTA TAAAATATTA ATTTTATAAA TATTAATTTA TAAAATATTA 

241 TA AAA T ATTT ATAATTAAAT ATTTTATAAT TAAAATGTTT ATAATTAAAT ATTTTATAAT 
ATTTTATAAA TATTAATTTA TAAAATATTA ATTTTACAAA TATTAATTTA TAAAATATTA 

301 T£ A £ A . TGTTT ATAATTACAT ATTTTATAAT TAAAATGTTT ATAATTACAT ATTTTATAAT 
ATTTTACAAA TATTAATGTA TAAAATATTA ATTTTACAAA TATTAATGTA TAAAATATtI 

361 T A ±^ TGTTT ATAATTACAT ATTTTATAAT TAAAATGTTT ATAATTACAT ATTTTATAAT 
ATTTTACAAA TATTAATGTA TAAAATATTA ATTTTACAAA TATTAATGTA TAAAATATTA 

421 T£££ A . TGTTT ATAATTACAT ATTTTATAAT TACATATTTT ATAAAGTATT TATAATTACA 
ATTTTACAAA TATTAATGTA TAAAATATTA ATGTATAAAA TATTTCATAA ATATTAATGT 

481 TATTTTATAA TTAAAGTATT TATAATTACA TATTTTATAA TTAAAGTATT TATAATTACA 
ATAAAATATT AATTTCATAA ATATTAATGT ATAAAATATT AATTTCATAA ATATTAATC? 

541 TATTTTATAA TTCAATATTT TATAAATAGT TAAAAAGACG AGGAAAAAAT TAAAAAGACG 
ATAAAATATT AAGTTATAAA ATATTTATCA ATTTTTCTGC TCCTTTTTTA ATTTTTCTGC 

601 AG GTT ATTG A TCTCAGGAAT TGTATTTGCC AAGTGAGAAG GAAAAAATAT TCACAAAGGC 
TCCAATAACT AGAGTCCTTA ACATAAACGG TTCACTCTTC CTTTTTTATA AGTGTTTCCG 

661 TTGTA 
AACAT 
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FIGURE 6 



1 TCTAGACCCC AGTTTCTCTA TAAGATGAGA ATATTAGTCA CGATTTGGTT TCTAAGATCC 
61 TGTCTATGTT TGAGACTACA GATACCTGTT GCTACATTTC CCTTCATAGC TCTGAACAAG 
121 GAGAATTCAG CCCAATTCTC ATGGCCTTCT AAACAATCCA GAGTTTCAGT GCCATAAGGT 
181 ACTACAATTT AGTGTCAAAT TAAGTCAAAG GCTTCATTAG CCTGAAAGCT CTGTCCCTGG 
241 CCTGGGCATG GCAAACTGTA TCCCCCACTG ACCATCCCCC TGTCTCCCTT CTCCCCAGAG 
301 ACTCCAGTAG CCTGGCGTCA TCACAGGGGC CAGACATATC CAACATGTTC CCAGCTTCCT 
361 GCCACTTGAC TTTCAGTGTG CCTCCCTCTT CAGTTACCCA AATCCTGCCC ACCATTCCAG 
421 AGCCAGTTCA ATCTCACCCA TCCAGGACCC CCGAGACCCC CATCGTACCA CTATAGTCTA 
2?i ^I2 TGGTGT AGACCCCACA C TGGGC AC AT TGCGTACGCT CATTATTGGC TGTGACGTCT 
541 GATTATGCCC TTCTCCTGGT CTGGAAGCTC TCGGAGGTGC TCCATAATAC ATGAAGAGAA 
601 GTAGTGCTGG TGTGGGAATA GTGAGGTGTG TTTATCCATC CAGCTATCCG GCACCAGCAC 
661 TGGTCTCAGC TTTCTGAGGT AACACGTTCT GAGCCTTAGT CTTGAGAGAA CATAAAGAAA 
721 ACTTTTTTTA AAAGTAGTAA AAAGTGGCTG ACAAAAGCTG ACCAAAAGCC TTCAAAAGAA 
781 ATGCTAAGTT ATATCTAAGA AAGTTTACCC AAGGTCAGGC AAATATGAAA CCTAAAGCTA 
841 GACGTGGGGA AGAACTTCCG GAGAGTTGCA ATTCCCTGTG CCCCAGCATC CCCAGGAGGG 
901 CATGCCCACA TCTGATTTAG AAATCTGTGT AAAATGAGTG AAGGTTTCTA TTTCTTGGGC 
961 AGTGTGGGCA CAGGTCTTTG GAGAGGTCGA TGGCCTCCCA TAAAATCCTT CCTGCTTGAT 
1021 GGTTCTGGAT CCTCAGCCAC AGCTCCTAAT AGCCATGAGG TTTGAGCCCA AAATAATTTA 
1081 TGTGTTTGTT TTTTCAGCCC CAAAATTTCC ATAGAATCAA AGTAGTCAGA GCTGAATGGG 
1141 GC TAAG AG AC CGTCCATTCC TGTCTTCTCA TCACAGATGA GGGACTGCCA CCCAGAgScS 
1201 TAGAAACTGT CCCATGGCCC CAGTTCCCAG ACCCTTCCTC TCTCCTACAG CTCCAAGTTC 
12 61 ACTGTGCATT CTAAATGAAG ATGTAAACAT AGGCAGCAAC ACTCAAGAGT AAAAATGAAG 
1321 TGTGCATATG AAAGAAACCT ATTCACATGG ACCATATTAC ATTATAATCA CAGTGTTTAC 
2°™™ CCATCTGCCT GGGCTAGCAA GGGTGTCAGT GAGGAAGAGA GGACAAGGGG 
1441 TACCAATCTG TG AAC T AC AC ATGGTTCTTG CTCTCCCAGC TTCTCTCTCC CATTGGCAAG 
1501 GCAACAGGTA AACACATGAA AAATCAAATA ATGCTATAAG AGAAAAATGT ATTCAGGACA 
\c->\ ^ AA ^ AGGTT TGTATGAAGG CCTTTCATCA TCGTTGTCCT ACCTAGAAAC TGAATGACAG 
1621 GGAATCAGAG TCACAAGCTA TGAAGTCTAA CTGGGCTGGT CCCAGAGAAA GATTCAGTGC 
1681 AGTAGGTGGG GCTGCAGCCA GCCCTGGGTG GGTGGAAGGA TGACATCCAC ATAGGCAAGA 
1741 GGGTGATAAT TCACTTGCGC AGCTCCTCAC TGCACATTGA ACCCTGCTGA CTTCTGGCTT 
1801 CTCTCCCGGG AGGAACTGCG ACTCAACATT CTGACCTTAT CTCTTGGGTA GCAGAATGAT 
1861 GGAGAAGGAA AGTTTCTTTT TGCTTCTCGC AGGGGTTAAT CATCCATCTG GAATGCCTAC 
1921 ATTTGGTTGA CAATGGCTCA CCCTATCATC TTCCTCCTGA ACCATTCACC TAAATGTGCC 
1981 ATTTCTTTCC TGATAGTTCT CATTTGTGTG TGTGTGTGTG TGTGTGTGTG TGCACGTGCT 
2 041 CACACATGCA TGCTGTCACT GGGTAAACAG GCCACCCTGG GCACAGTTCC ATCTACAATG 
2101 TTTGAAGTTT ACTTTCCAGC TTCTGGGCAT CATTTGCAAT TATAATGCTG TCACAGGCAG 
2161 AAACGAGATA GGCTAATTAA TCGTTGTCAA TACTGATCCC TATTTGCCAG ATGAGATTTT 
2221 GGAGCAGCAT GGCTGGGAAT AATTGGTATA GACTGTATTT CCTTGCTTTA TGTCACTGGA 
2281 AATATTTATT TAAGCATCAC GGTCGCTATG CATAAATATC CTGGAAAATG GGGTATAGCT 
2 341 GAATGGTGCA GATTCATTCA TTCATATTCA GCAAATTATG TTCTAAGCAC CTACTTCAGT 
2401 ACGTGAACAG CACTAAACTC AGAATATTGG TCTGCTGGGG TCCTTTATTA GCTTCCATGA 
24 61 TTCCCTGAAC TTGGCCAAGA CCCTTCTGGT CGGCTGCAGA TAGGCACAAT GGATAGTTTT 
2 521 GCTTCTAGA 
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